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Sir: 



In regard to the above-referenced application, Appellant submits this 
Appeal Brief. 



I. REAL PARTY IN INTEREST 

Deltagen, Inc., the assignee, is the real party in interest. The right of 
Deltagen, Inc. to take action in the subject application was established by virtue 
of the assignment from the inventor, Keith D. Allen, to Deltagen, Inc., recorded at s 
Reel 012697, Frame 0388. 1 
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judicial proceedings known to Appellant, the Appellant's legal representative, or 
assignee which are related to, directly affect or are directly affected by or have a 
bearing on the Board's decision in the present appeal 

III. STATUS OF CLAIMS 

Claims 1-27, 33-36, 38-46, 48-52 and 58 have been canceled, and claims 28- 
32, 37, 47 and 53-57 are pending. Claims 28-32, 37, 47 and 53-57 are appealed. 

IV. STATUS OF AMENDMENT 

An after final Supplementary Amendment was filed September 8, 2005 
amending claims 28-32 and 55-57, and canceling claims 52 and 58. This 
amendment was entered by the Examiner. Following entry of this amendment, 
claims 28-32, 37, 47 and 53-57 are pending and rejected. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

Claim 28, the sole independent claim, is drawn to a transgenic mouse 
whose genome comprises a null allele in the endogenous PTP36 gene. A 
transgenic mouse is a mouse whose genome has been modified such that a 
specific gene (e.g., the PTP36 gene) has been disrupted by the method of gene 
targeting. Specification, page 6, lines 29-30. The term "gene targeting" refers to a 
type of homologous recombination that occurs when a fragment of exogenous 
DNA (e.g., a targeting vector; see Figure 4) is introduced into a mammalian cell 
and that exogenous DNA locates and recombines with the endogenous 
homologous sequences. Specification, page 5, lines 23-25. A null allele is 
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disruption of the gene in which there is no significant expression of the PTP36 
gene. Specification, page 6, lines 23-24. A normal or wild-type mouse has two 
(2) copies of the PTP36 gene, each referred to as a wild-type allele. The claimed 
transgenic mouse includes both the heterozygous animal (i.e., one defective 
PTP36 allele and one wild-type PTP36 allele) and the homozygous animal (i.e., 
two defective PTP36 alleles). Id., page 6, lines 28- page 7, lines 2. By disrupting 
or "knocking out 1 the gene, the function of the PTP36 gene can be determined by 
comparing the transgenic mouse with a wild-type control mouse. Id., page 14, 
lines 5-10. 

Claim 37 is drawn to a cell or tissue isolated from the transgenic mouse of 
claim 28. Specification, page 2, lines 20-23; page 32, lines 22-27. 

Claim 47 is drawn to a method of producing the transgenic mouse of 
claim 28. The method comprises the steps of introducing a targeting construct 
capable of disrupting an endogenous PTP36 allele into a mouse embryonic stem 
cell; selected for the mouse embryonic stem that has undergone homologous 
recombination; introducing the mouse embryonic stem cell selected for into a 
blastocyst; implanting the resulting blastocyst into a pseudopregnant mouse, 
wherein the resultant mouse gives birth to a chimeric mouse; and breeding the 
chimeric mouse to produce the transgenic mouse. Specification, page 8, line 10- 
page 14, line 4. 

Claim 53, dependent from claim 28, is drawn to the transgenic mouse of 
claim 28 wherein the mouse is heterozygous for the null allele. Specification, 
page 14, lines 1-2; page 48, line 28- page 49, line 2; page 50, lines 10-11. 

Claim 54, dependent from claim 28, is drawn to the transgenic mouse of 
claim 28 wherein the mouse is homozygous for the null allele. Specification, 
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page 3, lines 5-6; specification, page 6, line 28- page 7, line 2; page 12, lines 1-4; 
page 49, lines 1-27. The inherent properties of the homozygous mouse of claim 
54 are recited in dependent claims 29-32. 

Claim 55, dependent from claim 28, is drawn to the transgenic mouse of 
claim 28 wherein the null allele comprises a gene encoding a selection marker. A 
"selection marker" refers to a gene encoding a product that enables only the 
cells that carry the gene to survive and/or grow under certain conditions. 
Specification page 5, lines 3-8. 

Claim 56 is drawn to the transgenic mouse of claim 55 wherein the gene 
encoding a selection marker is a neomycin resistance gene. The neomycin gene 
confers resistance to the compound G418 such that cells that do not carry the 
neomycin resistance gene are killed by G418. Specification, page 7, lines 3-9; 
Figure 4. 

Claim 57 is drawn to the transgenic mouse of claim 56 wherein the null 
allele further comprises a LacZ gene. See Figure 4; specification page 50, line 10- 
page 54, line 9. The LacZ gene allows one to determine where the PTP36 gene is 
expressed in heterozygous mice by staining with the X-Gal substrate. 
Specification, page 50, lines 10-13. 

Claims 29-31 are each drawn to the transgenic mouse of claim 54 wherein 
the mouse exhibits a specific uterine abnormality, relative to a wild-type control 
mouse. Specification page 49, lines 14-23. The uterine abnormality phenotype is 
an inherent property of the homozygous mouse of claim 54. 

Claim 32 is drawn to the transgenic mouse of claim 54 wherein the mouse 
exhibits, relative to a wild-type control mouse, increased organ weight in one or 
more of the specified organ systems. Increased organ weight in one or more of 
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the specified organ systems is an inherent property of the homozygous mouse of 
claim 54. 

VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

A. Whether Claims 28-32, 37, 47 and 53-57 are Unpatentable Under 
35 U.S.C. § 101 

Claims 28-32, 37, 47 and 53-57 are rejected under 35 U.S.C. § 101 on the 
grounds that the claimed invention is allegedly not supported by either a specific 
or substantial asserted utility or a well-established utility. 

B. Whether Claims 28-32, 37, 47 and 53-57 are Unpatentable Under 
35 U.S.C. § 112, first paragraph: Enablement 

Claims 28-32, 37, 47 and 53-57 are rejected under 35 U.S.C. § 112, first 
paragraph, on the grounds that the patent application does not enable one skilled 
in the art to use the claimed invention as a result of the alleged lack of either a 
specific or substantial asserted utility or a well-established utility. 

C Whether Claims 28-32, 37, 47, 53-54, and 57 are Unpatentable 
Under 35 U.S.C. § 112, first paragraph: Written Description 

Claims 28-32, 37, 47, 53-54, and 57 are rejected under 35 U.S.C. § 112, first 
paragraph, on the grounds that the claims contain subject matter which was not 
described in the specification in such a way as to reasonably convey to one 
skilled in the relevant art that the inventor, at the time of filing, had possession of 
the claimed invention. 
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D. Whether Claim 32 is Unpatentable Under 35 U.S.C. § 112, second 
paragraph: Definiteness 

Claim 32 is rejected under 35 U.S.C. § 112, second paragraph, as being 
indefinite for failing to point out and distinctly claim the subject matter which 
applicant regards as the invention. 

VII. ARGUMENT 

A. Technology Background 

The following steps are performed to obtain transgenic mice within the 
scope of the claims i.e., transgenic mice whose genomes comprise a null allele in 
the endogenous PTP36 gene. It is assumed that the Board is familiar with the 
basic principles of molecular genetics. 

1. A targeting vector is prepared. The targeting vector is a DNA 
construct that includes a LacZ-Neo cassette flanked on either side by sequences 
that are identical to sequences in the endogenous mouse PTP36 gene. These 
flanking sequences are referred to as "homology arms" and are depicted in 
Figure 4 of the application as including SEQ ID NO: 2 and SEQ ID NO: 3. The 
LacZ-Neo cassette includes a LacZ gene (which encodes the bacterial enzyme (3- 
galactosidase) and a Neo gene (which encodes a protein that confers resistance to 
the drug neomycin). The LacZ gene in the cassette lacks a promoter (the 
sequences required to direct transcription of the LacZ gene), whereas the Neo 
gene in the cassette is operatively linked to a promoter. See Figure 4. 

2. The targeting vector is introduced into mouse embryonic stem (ES) 
cells. ES cells are pluripotential cells isolated from mouse embryos that may be 
grown and manipulated in culture. The homology arms of the targeting vector 



6 



Express Mail Label No. EV780362226US 
Date of Deposit: November 16, 2005 

pair up with the complementary sequence in the endogenous mouse PTP36 gene. 
Through a process known as "homologous recombination," the targeting vector 
recombines with the PTP36 gene, precisely replacing specific bases of one 
chromosomal copy of the PTP36 gene with the LacZ-Neo cassette (the replaced 
bases of the PTP36 gene are depicted in Figure 2A (underlined nucleotides) and 
also in Figure 3 of the application). The replacement of these bases with the 
LacZ-Neo cassette ablates the function of the gene. The resulting disrupted 
PTP36 gene is referred to as a "null allele! 9 ES cells are then screened for 
resistance to the drug neomycin. If an ES cell is neomycin-resistant, this 
indicates that the ES cell expresses the Neo gene and, therefore, the ES cell has 
the LacZ-Neo cassette inserted into its genome. Disruption of the PTP36 gene in 
neomycin-resistant ES cells is then confirmed using the Polymerase Chain 
Reaction (PCR) technique and Southern blot analysis. 

3. ES cells in which the PTP36 gene has been disrupted by 
homologous recombination are then introduced into a host mouse blastocyst (a 
preimplantation embryo consisting of a hollow sphere of cells) obtained from the 
C57BL/6 mouse strain. C57BL/6 mice carry the black coat color allele. The ES 
cells are derived from the 129/OlaHsd mouse strain. 129/OlaHsd mice carry the 
agouti coat color allele. The manipulated blastocyst is then implanted into a 
pseudopregnant mouse (a female mouse that has been mated with a 
vasectomized male); the blastocyst develops and the female mouse gives birth to 
a chimeric mouse that developed from the manipulated blastocyst. Chimeric 
mice have regions of agouti colored fur— indicating tissue derived from the 
implanted ES cells— and regions of black fur— indicating tissue derived from the 
C57BL/6 host blastocyst. 
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4. Chimeric male mice are then mated with C57BL/6 females. If a 
chimeric male mouse has a germ line that is derived from an ES cell, then sperm 
from the chimeric mouse will carry the agouti coat allele found in the 
129/OlaHsd strain. The agouti coat allele is dominant over the black coat allele. 
As a result, if a chimeric male mouse has a germ line derived from an ES cell, 
then the progeny of that mouse will all have pure agouti coats. The agouti 
progeny, termed the Fl generation, receive one wild-type copy of the PTP36 gene 
from the C57BL/6 mother, and one copy of the PTP36 gene from the ES cell- 
derived germ line of their chimeric fathers. Because only one copy of the PTP36 
gene is knocked out in the ES cells, half of the Fl progeny inherit a disrupted 
copy of the PTP36 gene— a null allele— from their father, and half inherit a 
normal (wild-type) copy. In genetic nomenclature, the wild-type copy of the 
PTP36 gene is referred to as "+" and the null allele of the PTP36 gene is referred 
to as Hence, the Fl progeny are either +/+ or +/-. Mice that are +/- are 
referred to as "heterozygous mice" or as "heterozygotes" and are identified by 
genotyping techniques (Southern blot analysis and PCR analysis). 

5. Fl heterozygous mice are then backcrossed to C57BL/6 mice, and 
heterozygous male and female mice in the resulting litters are again selected. 
These heterozygous males and females are then intercrossed. The resulting mice 
are referred to as the F2 generation. One quarter of the F2 progeny are +/+, one 
quarter are heterozygotes (+/-) and one quarter have two copies of the null allele 
(-/-) of the PTP36 gene. Mice that are -/- are referred to as "homozygous mice" or 
as "homozygotes" The F2 heterozygous mice and homozygous mice are then 
phenotypically characterized and compared with age-matched and gender- 
matched F2 +/+ mice, referred to herein as "wild-type control mice" It typically 
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takes approximately one year from the initial targeting of the gene in ES cells to 
generate F2 homozygous mice. 

6. The expression pattern of the LacZ gene is determined in tissue 
samples from heterozygous mice. Because the LacZ gene does not have its own 
promoter, transcription of the LacZ gene is driven by the endogenous PTP36 
gene promoter. As a result, the LacZ gene is expressed in the same cells that the 
PTP36 gene is expressed in. LacZ expression is detected by contacting tissue 
from heterozygous mice with a chemical named "X-gal." (3-galactosidase (the 
enzyme encoded for by the LacZ gene) catalyzes the conversion of X-gal into a 
blue colored reaction product; the staining of tissue with this blue product is 
visualized using a microscope. 

B. Legal Standard 

1. General Principles 

(a) Utility under 35 U.S.C. § 101 

35 U.S.C. § 101 states that "[w]hoever invents... any new and useful 
. . .composition of matter . . .may obtain a patent therefore. . .." 35 U.S.C. § 101 
(emphais added). In Brenner v. Manson, 383 U.S. 519 (1966), the Supreme Court 
explained that 

[t]he basic quid pro quo contemplated by the Constitution and the 
Congress for granting a patent monopoly is the benefit derived by the 
public from an invention with substantial utility. Unless and until a 
process is refined and developed to this point - where specific benefit 
exists in currently available form - there is insufficient justification for 
permitting an applicant to engross what may prove to be a broad field. 

Brenner, 383 U.S. at 534-535 (emphases added). This language has 
subsequently been interpreted by the Federal Circuit to mean that an invention is 
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useful if it has both a "substantial" utility and a "specific" utility. See In re Fisher, 
421 F.3d 1365, 1371 (Fed. Or. 2005). 

The terms "practical utility" and "real world utility " are often used in 
determining whether an invention offers a "substantial" utility. See Nelson v. 
Bowler, 626 F.2d 853, 856, 206 USPQ 881, 883 (C.C.P.A. 1980). An invention has a 
practical and real world utility, and hence a substantial utility, when "one skilled 
in the art can use a claimed discovery in a manner which provides some 
immediate benefit to the public." In re Fisher, 421 F.3d at 1371 (citing Nelson v. 
Bowler, 626 F.2d 853, 856). Accordingly, for an invention to have a substantial 
utility, "an asserted use must show that the claimed invention has a significant 
and presently available benefit to the public." In re Fisher, 421 F.3d at 1371. 

With regard to the "specific" utility requirement, "an application must 
disclose a use which is not so vague as to be meaningless." Id. An asserted use 
must show that the "claimed invention can be used to provide a well-defined 
and particular benefit to the public." Id. A specific utility is particular to the 
subject matter claimed as compared to a "general utility" that would be 
applicable to the broad class of the invention. See id; see also U.S. Pat. & 
Trademark Off., Manual of Patent Examining Procedure § 2107.01 (8th ed. 2001, 
rev. May 2004). 

The Federal Circuit has recognized that "[t]he threshold of utility is not 
high." Juicy Whip v Orange Bang, 185 F.3d 1364, 1366 (Fed. Cir. 1999). Rather, an 
invention has utility "if it is capable of providing some identifiable benefit." Id. 
A finding of lack of utility is warranted only if the invention is "totally incapable 
of achieving a useful result." Brooktree Corp. v. Advanced Micro Devices, Inc., 977 
F.2d 1555, 1571 (Fed. Cir. 1992). Accordingly, utility must be found even if an 
invention is only partially successful in achieving a single credibly asserted use. 
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Moreover, utility exists even if further research and development on the claimed 
invention is expected. See In re Brana, 51 F.3d 1560, 1568 (Fed. Cir. 1995). 

An assertion of utility is presumed to be true. See In re Brana, 51 F.3d at 
1566 (citing In re Marzocchi, 439 F.2d 220, 223 (C.C.P.A. 1971)). From this it 
follows that the PTO has the initial burden of challenging as non-credible a 
presumptively correct assertion of utility in the disclosure. Only after the PTO 
provides evidence showing that one of ordinary skill in the art would reasonably 
doubt the credibility of the asserted utility does the burden shift to the applicant 
to provide rebuttal evidence sufficient to convince such a person of the 
invention's asserted utility. See id. If one skilled in the art would "immediately 
appreciate why" a claimed invention has a specific and substantial utility, then 
the invention has a "well-established utility, " notwithstanding the failure of the 
application to explicitly state that utility. See U.S. Pat. & Trademark Off., Manual 
of Patent Examining Procedure § 2107.02 II.B. 

Any reasonable use that an applicant has identified for the invention that 
can be viewed as providing a public benefit should be accepted as a substantial 
utility. See U.S. Pat. & Trademark Off., Manual of Patent Examining Procedure § 
2107.01. Proof of an asserted utility "beyond a reasonable doubt" or as a 
statistical certainty is not required; instead, evidence will be sufficient if, 
considered as a whole, it leads one skilled in the art to conclude that the asserted 
utility is more likely than not true. See U.S. Pat. & Trademark Off., Manual of 
Patent Examining Procedure § 2107.02 VII. The character and amount of 
evidence needed may vary depending on what is claimed. See In re Jolles, 628 
F.2d 1322, 1325 (C.C.P.A. 1980) (citing In re Chilowsky 229 F.2d 457, 461 (C.C.P.A. 
1956)). Nonetheless, there is "no basis in the statutes or decisions for requiring 
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any more conclusive evidence of operativeness in one type of case than another." 
In re Chilowsky, 229 F.2d at 461-2. 

Proof of utility is conclusively established as a matter of law if an 
infringing party has made, sold, or used the claimed invention. See Raytheon Co. 
v. Roper, 724 F.2d 951, 959 (Fed. Cir. 1983). The Federal Circuit has characterized 
this as a "common sense" rule because "[p]eople rarely, if ever, appropriate 
useless inventions." Raytheon Co. 724 F. 2d at 959. Similarly, proof of utility is 
supported when a claimed invention has been met with commercial success. See 
Raytheon Co. v. Roper Corp. 724 F. 2d at 959; see also In re Fisher, 421 F.3d at 22. 
This rule is a logical corollary of the Supreme Court's instruction in Brenner that 
the patent system is "related to the world of commerce rather than to the realm 
of philosophy." Brenner, 383 U.S. at 536 (quoting In re Ruschig, 343 F.2d 965, 970 
(C.C.P.A. 1965)). In other words, just as "[p]eople rarely, if ever, appropriate 
useless inventions," Raytheon Co. 724 F. 2d at 959 (emphasis added), people 
rarely, if ever, purchase useless inventions. 

(b) Enablement Under 35 U.S.C. § 112, first paragraph 
If an invention lacks utility under 35 U.S.C. § 101, then the patent 
application also fails to enable one of ordinary skill in the art to use the invention 
as required by 35 U.S.C. § 112, first paragraph. See In re Kirk, 376 F.2d 936, 942 
(C.C.P.A. 1976) ("Necessarily, compliance with § 112 requires a description of 
how to use presently useful inventions, otherwise an applicant would 
anomalously be required to teach how to use a useless invention."); see also U.S. 
Pat. & Trademark Off., Manual of Patent Examining Procedure § 2107.01. 
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2. In re Fisher 

The Federal Circuit's recent utility analysis in In re Fisher, 421 F.3d 1365 
(Fed. Cir. 2005) is particularly relevant to the instant patent application. In In re 
Fisher, the Appellant filed a patent application which disclosed more than 30,000 
short nucleotide sequences, termed "expressed sequence tags" (ESTs) from Zea 
mays (maize). Ex parte Fisher, App. No. 2002-2046, slip op. at 1 (Bd. Pat. App. Int. 
Mar. 16, 2004). ESTs correspond to fragments of expressed genes. ESTs are 
routinely and rapidly generated by machine-automated sequencing of cDNA 
clones at a rate of many hundreds of ESTs per day. At the time of writing, there 
are almost six hundred thousand publicly-described maize ESTs, and almost six 
million publicly-described human ESTs. See National Center for Biotechnology 
Information (NCBI) dbEST Database (<<http://www.ncbi.nlm.nih.gov/dbEST/ 
dbEST_summary.html»). 

Upon final rejection of claims drawn to five elected ESTs for lack of a 
specific and substantial utility and for lack of enablement, Fisher appealed to the 
Board arguing that ESTs have a number of specific uses, including 1) use for the 
identification of polymorphisms; and 2) use as probes or primers for the 
identification of orthologous genes from other plants and organisms. Ex parte 
Fisher, App. No. 2002-2046, slip op. at 1. 

The Board affirmed the Examiner's rejection, finding that the asserted uses 
were insubstantial because the function of the genes from which the ESTs were 
derived was unknown. Ex parte Fisher, App. No. 2002-2046, slip op. at 15. 
Specifically, the Board found that the asserted use of identifying polymorphisms 
was insubstantial because without knowing the function of the gene, detection of 
a polymorphism provides only "the barest information." Id. Similarly, using the 
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ESTs from genes having no known function to isolate other genes having no 
known function was insubstantial. See id. 

The Federal Circuit affirmed. The court held first that to constitute a 
substantial utility, an asserted use "must show that the claimed invention has a 
significant and presently available benefit to the public." In re Fisher, 421 F.3d at 
1371. In this case, the Federal Circuit held that "our precedent dictates that the 
'643 application does not meet the utility requirement of § 101 because Fisher 
does not identify the function for the underlying protein-encoding genes/' In re 
Fisher, 421 F.3d at 1376 (emphasis added). The court emphasized that without 
the identification of the function of their cognate genes, the claimed invention 
has "not been researched and understood to the point of providing an 
immediate, well-defined, real-world benefit to the public meriting the grant of a 
patent." Id. Instead, the claimed ESTs were merely "objects upon which 
scientific research could be performed with no assurance that anything useful 
will be discovered in the end." In re Fisher, 421 F.3d at 1373 (quoting Brenner, 383 
U.S. at 535). 

The court focused on the fact that the Appellant provided no evidence 
whatsoever— such as test data or declarations— to prove that any of the claimed 
ESTs had successfully been put to the alleged uses in the real world. This 
evidentiary failure left the court with "no choice but to conclude that the claimed 
ESTs do not have a 'substantial' utility under § 101." In re Fisher, 421 F.3d at 
1374. 

The court also held that to constitute a specific utility, an asserted use 
"must also show that the claimed invention can be used to provide a well- 
defined and particular benefit to the public." In re Fisher, 421 F.3d at 1371. In this 
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case, the asserted uses were not specific, the court found, because any EST could 
be used for the asserted uses. See In re Fisher, 421 F.3d at 1374. 

The court agreed that commercial success may support the utility of an 
invention. See In re Fisher, 421 F.3d at 1377-78 (citing Raytheon Co. v. Roper Corp., 
724 F.2d 951, 959 (Fed. Cir. 1983)). In this case, however, there was no evidence 
that any company had even expressed interest in, let alone purchased, a single 
one of the claimed ESTs. See In re Fisher, 421 F.3d at 1378. 

Finally, the court addressed concerns expressed by the government that 
allowing EST patents without proof of utility would discourage and delay 
research. The court warned that such public policy concerns must not be 
considered by courts in determining whether the utility requirement is met; such 
concerns should be addressed only by Congress. See In re Fisher, 421 F.3d at 1378. 

C. The Subject Matter Of Each of Claims 28-32, 37, 47, 53-57 Has 
Multiple Credible and Well-Established Uses 

The following arguments show that the presently-claimed invention has 
multiple credible and well-established utilities, each of which is substantial and 
specific. As a claimed invention need only be partially successful in achieving a 
single credibly asserted or well-established utility in order to pass muster under 
35 U.S.C. § 101, see Brooktree Corp., 97 F.2d at 1571, the claimed invention more 
than adequately satisfies the utility requirement. 

Note that in the discussion that follows, certain uses are described with 
reference to the homozygous mouse (a mouse that has two null alleles of the 
PTP36 gene) and tissue and cells isolated from the homozygous mouse. 
Homozygous mice, also referred to herein as "PTP36 knockout mice/ 7 are within 
the scope of claims 28-32 and 54-57; isolated tissues and cells from homozygous 



15 



Express Mail Label No. EV780362226US 
Date of Deposit: November 16, 2005 

mice are within the scope of claim 37; and the inherent phenotypes exhibited by 
homozygous mice are recited in claims 29-32. Since the homozygous mouse has 
utility, the heterozygous mouse (a mouse having only one null allele of the 
PTP36 gene) necessarily has utility because heterozygous mice must be used to 
generate the homozygous mice. See Section VILA supra. Heterozygous mice are 
within the scope of claims 28, 53, and 55-57. Heterozygous mice, and tissues and 
cells isolated from heterozygous mice (within the scope of claim 37), are also 
used to determine the gene expression pattern of PTP36. See Section VII.C.5 
below. Similarly, because the homozygous mice and the heterozygous mice have 
utility, the claimed method for generating homozygous and heterozygous mice 
(claim 47) also has utility. 

1. The PTP36 Knockout Mouse Is Used to Study The Function 
Of The PTP36 Gene. 

(a) The Asserted Utility Of Using The PTP36 Knockout Mouse 
To Study The Function Of The PTP36 Gene Is A Well- 
Established Utility. 

Contary to the Examiner's assertions, the present invention has a well- 
established utility since a person of ordinary skill in the art "would immediately 
appreciate why" (M.P.E.P. § 2107.02 II.B) the claimed PTP36 knockout mice are 
useful. Specifically, knockout mice have the inherent and well-established utility 
of defining the function and role of the disrupted target gene. 

The sequencing of the human genome has produced thousands of genes 
whose function has yet to be determined. Knockout mice are the recognized 
"gold standard" for determining the function of those genes. According to the 
United States National Institutes of Health (NIH), knockout mice are critical to 
studying gene function: 
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Over the past century, the mouse has developed into the premier 
mammalian model system for genetic research. Scientists from a wide 
range of biomedical fields have gravitated to the mouse because of its 
close genetic and physiological similarities to humans, as well as the 
ease with which its genome can be manipulated and analyzed. 

In recent decades, researchers have utilized an array of innovative 
genetic technologies to produce custom-made mouse models for a wide 
array of specific diseases, as well as to study the function of targeted 
genes... .[P]owerful approaches, dependent on homologous 
recombination, have permitted the development of tools to "knock out ,f 
genes, which involves replacing existing genes with altered versions... 
To preserve these extremely valuable strains of mice and to assist in the 
propagation of strains with poor reproduction, researchers have taken 
advantage of state-of-the-art reproductive technologies, including 
cryopreservation of embryos, in vitro fertilization and ovary 
transplantation. 

(«http://www.genome.gov/pfv.cfm?pageid=10005834»)(emphasis 
addedKExhibit A)(see Amendment filed February 17, 2005). Thus, the 
knockout mouse has been accepted by the NIH as the premier model for 
determining and studying gene function. 

Knockout mice are so well accepted as useful for determining gene 
function that the director of the NIH Chemical Genomics Center of the National 
Human Genome Research Institute (among others, including Capecchi, Bradley, 
Joyner, Nagy and Skarnes, each a pioneer and recognized expert in the field of 
mammalian genomics) has proposed creating knockout mice for all mouse 
genes: 

Now that the human and mouse genome sequences are known, 
attention has turned to elucidating gene function and identifying gene 
products that might have therapeutic value. The laboratory mouse 
(Mus musculus) has had a prominent role in the study of human disease 
mechanisms throughout the rich, 100-year history of classical mouse 
genetics, exemplified by the lessons learned from naturally occurring 
mutants such as agouti, reeler and obese.... Among the model 
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organisms, the mouse offers particular advantages for the study of 
human biology and disease: (i) the mouse is a mammal, and its 
development, body plan, physiology, behavior and diseases have much 
in common with those of humans; (ii) almost all (99%) mouse genes 
have homologs in humans; and (iii) the mouse genome supports 
targeted mutagenesis in specific genes by homologous recombination in 
embryonic stem (ES) cells, allowing genes to be altered efficiently and 
precisely. 

A coordinated project to systematically knock out all mouse genes is 
likely to be of enormous benefit to the research community, given the 
demonstrated power of knockout mice to elucidate gene function, the 
frequency of unpredicted phenotypes in knockout mice, the potential 
economies of scale in an organized and carefully planned project, and 
the high cost and lack of availability of knockout mice being made in 
current efforts. 

(Austin et al., Nature Genetics (2004) 36(9):921-24, 921)(emphasis 
addedKExhibit B) (see Amendment filed February 17, 2005). 

By way of further example, according to the Molecular Biology of the Cell 
(Albert, 4 th ed., Garland Science (2002)) (Exhibit C) (see Amendment filed 
February 17, 2005), one of the leading textbooks in the field of molecular biology: 

Extensive collaborative efforts are underway to generate 
comprehensive libraries of mutation in several model organisms 
including . . . the mouse. The ultimate goal in each case is to produce a 
collection of mutant strains in which every gene in the organism has 
either been systematically deleted, or altered such that it can be 
conditionally disrupted. Collections of this type will provide an 
invaluable tool for investigating gene function on a genomic scale. 

(p. 543)(emphasis added). 

According to Genes VII (Lewin, Oxford University Press (2000)) (Exhibit 
D) (see Amendment filed February 17, 2005), another well respected textbook in 
the field of genetics: 
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The converse of the introduction of new genes is the ability to disrupt 
specific endogenous genes. Additional DNA can be introduced within 
a gene to prevent its expression and to generate a null allele. Breeding 
from an animal with a null allele can generate a homozygous 
"knockout", which has no active copy of the gene. This is a powerful 
method to investigate directly the importance and function of the gene. 

(p. 508)(emphasis added). 

According to Joyner (Gene Targeting: A Practical Approach, preface, 
Oxford University Press 2000) (Exhibit E) (see Amendment filed February 17, 
2005) "[g]ene targeting in ES cells offers a powerful approach to study gene 
function in a mammalian organism/ 7 

According to Matise et al (Production of Targeted Embryonic Stem Cell 
Clones in Joyner, Gene Targeting: A Practical Approach, Oxford University 
Press 2000)(Exhibit F) (Amendment filed February 17, 2005): 

The discovery that cloned DNA introduced into tissue culture cells can 
undergo homologous recombination at specific chromosomal loci has 
revolutionized our ability to study gene function in cell culture and in 
vivo. . . . Thus, applying gene targeting technology to ES cells in culture 
affords researchers the opportunity to modify endogenous genes and 
study their function in vivo. 

(p. 101)(emphasis added). 

According to Crawley (What's Wrong With My Mouse Behavioral 
Phetto typing of Transgenic and Knockout Mice, Wiley-Liss 2000) (Exhibit G) 
(see Amendment filed February 17, 2005): 

Targeted gene mutation in mice represents a new technology that is 
revolutionizing biomedical research. 
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Transgenic and knockout mutations provide an important means for 
understanding gene function, as well as for developing therapies for 
genetic diseases. 

(p. 1, rear cover). 

According to Doetschman (Laboratory Animal Science 49:137-143, 137 
(1999)(Exhibit H)(emphasis addedMsee Amendment filed February 17, 2005): 

[t]he conclusions will be that the knockout phenotypes do, in fact, 
provide accurate information concerning gene function, that we should 
let the unexpected phenotypes lead us to the specific cell, tissue, organ 
culture, and whole animal experiments that are relevant to the function 
of the genes in question, and that the absence of phenotype indicates 
that we have not discovered where or how to look for a phenotype. 

Statements in the prior art, such as those provided above, may be used to 
establish that a person of ordinary skill in the art would immediately appreciate 
the asserted utility. See In re Brana, 51 F.3d at 1567 ("The prior art further 
supports the conclusion that one skilled in the art would be convinced of the 
asserted utility.' 7 ) Accordingly, Appellant submits that in light of arguments of 
record a person of ordinary skill in the art would immediately appreciate one 
reason why the invention is useful: for studying the function of the PTP36 gene. 

In addition, Appellant notes that the National Institutes of Health (N.I.H.) 
has recently announced a three-year contract with the assignee of the instant 
application, Deltagen, Inc., to procure knockout mice. The Director of the N.I.H., 
Elias Zerhouni, M.D. has stated that the "decision to procure these knockout 
mouse lines and data and make them available to the research community will 
yield tremendous benefits, both in the short and long terms." See "Researchers 
to Gain Wider Access to Knockout Mice," available at «http://www.genome. 
gov/17015131» (Exhibit I). Thus, the NIH clearly believes that knockout mice 



20 



Express Mail Label No. EV780362226US 
Date of Deposit: November 16, 2005 

are useful, specifically including knockout mice generated by the assignee of the 
instant application. 

(b) The Asserted Utility Of Using The PTP36 Knockout Mouse 
To Study The Function Of The PTP36 Gene Is Credible. 

A "credible" assertion of utility is one that is 

[B]elievable to a person of ordinary skill in the art based on the totality 
of evidence and reasoning provided. An assertion is credible unless (A) 
the logic underlying the assertion is seriously flawed, or (B) the facts 
upon which the assertion is based are inconsistent with the logic 
underlying the assertion. 

(MPEP 2107.02, III(B)(emphasis added). 

In light of the arguments made above which show that the knockout 
mouse is the premier tool known in the art for determining gene function, the 
asserted utility of using the claimed mouse to study the function of the PTP36 
gene would clearly be believable to a person of ordinary skill in the art. 
Therefore, the asserted use is credible. 

(c) The Asserted Utility Of Using The PTP36 Knockout Mouse 
To Study The Function Of The PTP36 Gene Is A Specific 
Utility 

Use of the PTP36 knockout mouse to study the function of the PTP36 gene 
is specific to this mouse. Only the PTP36 gene has been disrupted in the PTP36 
knockout mouse. The ability to target specific genes using mouse ES cells is 
described below: 

One remarkable property of many mammalian cell lines, including ES 
cells, is the ability of transfected DNA to locate and recombine with its 
homologous chromosomal counterpart in the genome. This process, 
commonly referred to as gene targeting, is being used in many 
laboratories to generate ES cells and ultimately mouse lines in which 
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specific loci have been modified. A common rationale to disrupt gene 
function by gene targeting in ES cells is to construct a vector which is 
designed to undergo a homologous genetic exchange with its 
chromosomal counterpart. These vectors are typically arranged so that 
they insert additional sequences, such as a positive selection marker, 
into the coding elements of the target, thereby ablating the function of 
the target gene. 

Since gene targeting techniques are designed to generate specific 
recombinant alleles, features of the desired recombinant locus have 
been used to select in vitro for targeted clones and against random 
integration events, thereby enhancing the representation of targeted 
clones in selected populations. 

(Bradley et ah, Biotechnology 10:534-538, 535-6 (1992) (cited by Examiner, 
paper no. 8)(emphasis added). Since each knockout mouse is specifically 
designed to disrupt a particular gene, the use of each mouse is specific for 
studying the function of that disrupted gene. Thus, only a PTP36 knockout 
mouse (as opposed to all other knockout mice) would be used to study the 
function of the PTP36 gene. 

(d) Actual Use Of The PTP36 Knockout Mouse To Study Gene 
Function Constitutes A Real World Use 

Determining the function of the PTP36 gene constitutes a real world use as 
demonstrated by Assignee Deltagen's phenotypic analyses of the claimed PTP36 
knockout mouse and comparison with wild- type control mice. The claimed mice 
were actually generated. The claimed mice were successfully used in the 
asserted manner— tor determining the function of the PTP36 gene— by studying 
phenotypic changes in comparison with wild-type control mice. The claimed 
mice were subjected to an exhaustive battery of phenotypic screens including: 

• physical examinations; 

• necropsy, including body length, body weight, and organ weight 
measurements; 
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• histological examination of tissues and organs; 

• bone marrow section evaluations; 

• complete blood counts and differentials; 

• clinical chemistry panels; and 

• behavioral tests. 

See Specification, page 48, lines 1-9. 

Any statistically significant changes between the claimed mice and wild- 
type control mice are the result of the disruption of the PTP36 gene. See 
specification, page 18, line 21 through page 19, line 7. According to the 
specification (page 49, lines 14-23) (emphasis added): 

The homozygous female mice did not have demonstrable mammary 
gland tissue or had only a few mammary ducts on deeper levels. The 
homozygous female mice also contained the presence of keratin in the 
uterine horns; more specifically, keratinous debris was detected, but 
without concomitant squamous metaplasia, suggestive of passage of 
keratin from the vagina through a dilated cervix, indicating cervical 
relaxation. Increased anogenital distance in homozygous mutant 
females was also detected. The combination of lack of mammary 
growth and cervical relaxation are suggestive of a hormonal imbalance. 
Hormones having an effect on reproductive and/ or mammary tissues 
during development include estrogens, progesterones, growth 
hormone, thyroxine and insulin. The phenotypes observed are 
consistent with androgenization of the mutant female mice. 

Thus, by generating and studying the claimed mice, the Appellant has 
identified the function of the underlying protein encoding PTP36 gene, 
specifically the role of the PTP36 gene in the generation, inter alia, of mammary 
gland tissue. This stands in sharp contrast to the situation in In re Fisher, where 
the Federal Circuit's finding of lack of utility was based on the total failure of the 
appellant to show that the claimed ESTs had been successfully used in any of the 
asserted ways to determine the function of the underlying protein-encoding 
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gene. See In re Fisher, 421 F.3d at 1376 (holding that ''precedent dictates that the 
[patent application] does not meet the utility requirement of § 101 because Fisher 
does not identify the function for the underlying protein-encoding genes/') In 
other words, the Appellant has studied the claimed invention "to the point of 
providing an immediate, well-defined, real-world benefit to the public meriting 
the grant of a patent." Id. Unlike the invention in In re Fisher, the claimed 
invention is not merely an "object upon which scientific research could be 
performed with no assurance that anything useful will be discovered in the end." 
In re Fisher, 421 F.3d at 1374. To the contrary: the claimed invention has provided 
the useful end result of defining the function of the underlying PTP36 gene. As 
one skilled in the art would immediately appreciate, ascribing functions to genes 
is the end goal of genomics research. 

The Examiner has argued that using the claimed mouse to study the 
function of the PTP36 gene constitutes "further research," and that such "further 
research" is not a substantial utility. See Office Action mailed March 22, 2005, 
page 8. The Examiner characterizes the utility of using the claimed mouse to 
study the function of the PTP36 gene as a "scientific utility" rather than a 
"patentable utility." See id. In support of this proposition, the Examiner cites 
U.S. Pat. & Trademark Off., Manual of Patent Examining Procedure § 2107.01.1: 

Utilities that require or constitute carrying out further research to 
identify or reasonably confirm a "real world" context of use are not 
substantial utilities. ...[T]he following are examples of situations that 
require or constitute carrying out further research to identify or 
reasonably confirm a "real world" context of use and, therefore, do not 
define "substantial utilities": 

(A) Basic research such as studying the properties of the claimed 
product itself or the mechanisms in which the material is involved; 
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The language cited by the Examiner establishes that an invention has no 
substantial utility when "further research" is required to identify or confirm any 
utility. In other words, the cited language deals only with the situation where 
there is no known utility for a claimed invention, and where the only asserted 
use is studying the invention to determine a utility. This is clearly not the 
situation presented by the claimed PTP36 knockout mice. The Appellant has 
actually studied the claimed mouse and has discovered that it has observable 
phenotypes, such as a lack of mammary gland tissue. Thus, the Appellant has 
used the claimed invention in the manner asserted and has discovered that the 
PTP36 gene is involved in the generation of mammary gland tissue. 
Accordingly, the Appellant has established that the claimed mice can be used to 
study the role of the PTP36 gene in the development of mammary gland tissue. 

To be sure, further research on the mouse will be required to refine the 
precise details of the molecular pathway through which PTP36 participates in the 
development of mammary gland tissue; however, the expectation of such further 
research is not inimical to a finding of utility. See In re Brana, 51 F.3d at 1568 
("Usefulness in patent law... necessarily includes the expectation of further 
research and development"). No further research is required to identify a utility 
for the PTP36 knockout mouse or to confirm that the PTP36 knockout mouse is 
useful for studying the role of PTP36 in the generation of mammary gland tissue. 
Put simply, there is a world of difference between using the mouse to determine 
any utility and using the mouse to determine the role of the PTP36 gene in, for 
example, the generation of mammary gland tissue. 

The Examiner has argued that none of the described phenotypes 
"correlate with any known disease (either mouse or human)." Office Action, 
mailed March 22, 2005, page 8. As a result, the Examiner contends, "it is not 
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apparent to one of ordinary skilled [sic] in the art to immediately appreciate how 
to use the claimed mouse base [sic] on such characteristics/' Id. As described in 
Section VILE below, the phenotypes described are characteristic of 
hyperandrogenism (an excess of androgens) in females. Accordingly, the 
described phenotypes do correspond to a known disease in humans. In any 
event, it is entirely unnecessary for the described phenotypes to correlate to a 
known disease in order for the claimed knockout mouse to be useful in 
determining the function of the PTP36 gene. Useful information on the function 
of a gene can be obtained solely by studying phenotype(s), irrespective of the 
correlation of those phenotypes to a known disease. While the correlation of 
phenotypes to a disease provides additional useful information regarding the 
function of the gene, it is not a prerequisite— neither in the law nor from a 
practical, scientific viewpoint— for obtaining any useful functional information. 

The Examiner has also argued that, in general, "knockout mice may not be 
capable of elucidating the function of the protein and may provide a clue to [the] 
pathway the protein being knocked out is involved in." Office Action mailed 
March 22, 2005, page 10 (emphasis omitted). In support of this proposition, the 
Examiner cites R.W. Olsen & G.E. Homanics, Function ofGABA-A Receptors; 
Insights From Mutant and Knockout Mice, in GABA in the Nervous System: The 
View at 50 Years 81-86 (D.L. Martin, and R.W. Olsen, eds., 200) as stating: 

Although gene targeting is often useful in delineating the contribution 
of a given gene product to phenotypic characteristics observed, some 
gene knockouts lead to embryonic or perinatal lethality, and others 
lead to no apparent phenotype. This can arise from the lack of any role 
for the gene in question in regard to the trait studies or from 
compensation by other gene products. Analysis of the compensation 
can yield valuable clues to the genetic pathway. 

Id. at 82 (emphasis added). 
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The quote from Olsen & Homanics concerns only cases where there is 
early lethality or where there is no visible phenotype, i.e., where the knockout 
mice appear identical to wild-type mice. This is clearly not the case for the 
claimed PTP36 knockout mice. Unlike the examples discussed by Olsen & 
Homanics, the claimed PTP36 knockout mice have phenotypes and do not 
exhibit perinatal or embryonic lethality. Moreover, Olsen & Homanics state that 
"gene targeting is often useful in delineating the contribution of a given gene 
product to phenotypic characteristics observed/' Id. Accordingly, Olsen & 
Homanics actually confirm that the claimed PTP36 knockout mice have a 
substantial utility. 

(e) Actual Use Of The Gene Function Data Derived From The 
Use Of The PTP36 Knockout Mouse Constitutes A Further 
Real World Use. 

The claimed mouse has been extensively analyzed using the tests set forth 
in the Examples. Such tests reveal the function of the PTP36 gene in, inter alia, 
the generation of mammary gland tissue. This data has been incorporated into 
Appellant Deltagen's commercial database product, DeltaBase. Accordingly, the 
application provides: 

In one embodiment, the phenotype (or phenotypic change) associated 
with a disruption in the PTP36 gene is placed into or stored in a 
database. Preferably, the database includes: (i) genotypic data (e.g., 
identification of the disrupted gene) and (ii) phenotypic data (e.g., 
phenotype(s) resulting from the gene disruption) associated with the 
genotypic data. The database is preferably electronic. In addition, the 
database is preferably combined with a search tool so that the database 
is searchable. 

Specification, page 14, lines 11-16. DeltaBase is the world's most 
comprehensive commercial library of in-depth functional information regarding 
in vivo mammalian gene function and their relevancy to small molecule drug 
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discovery. DeltaBase contains phenotypic information on approximately seven 
hundred and fifty target genes, including PTP36. DeltaBase offers the ability to 
not only further study the function of the PTP36 gene, but also provides a 
consistent analysis platform by allowing comparisons between data sets for all 
the gene knockouts contained in the database. See "Deltagen and Merck Enter 
Into DeltaBase License Agreement/' available at <<http://www.sec.gov/Archives/ 
edgar/data/1034072/000089843002000648/dex992.htm»(ExhibitJ) 

The commercial success of the database is demonstrated by subscriptions 
to DeltaBase by at least three of the world's largest pharmaceutical companies, 
Merck, Pfizer and GSK. See Amendment filed June 22, 2005, page 7; see also 
Declaration of Robert Driscoll, filed June 22, 2005 (Exhibit K). To date, the sum 
of the subscription and related fees from just these three subscribers alone 
amounts to more than sixty million dollars ($60,000,000.00). This evidence of 
commercial success more than satisfies the practical utility requirement of 35 
U.S.C. § 101 because "[p]roof of . . . utility is further supported when, as here, the 
inventions set forth in [the] claims . . . have on their merits been met with 
commercial success/' Raytheon Co. at 959. 

By creating a knockout mouse with a disruption in the PTP36 gene, and 
incorporating the data into a commercial product, and selling that data to 
pharmaceutical companies, Appellant has offered an "an immediate, well- 
defined, real- world benefit to the public/' In re Fisher, 421 F.3d 1376. Unlike the 
Appellant in In re Fisher, Appellant has actually used the invention in the real 
world. 
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(f) Commercial Sale Of The PTP36 Knockout Mouse 
Constitutes A Further Real World Use. 

The Examiner has argued that "[commercial success is only considered as 
secondary evidence for overcoming a [Section] 103(a) rejection/' Advisory 
Action mailed July 14, 2005. This statement of the law is incorrect. The Federal 
Circuit has clearly indicated that "commercial success may support the utility of 
an invention/ 7 In re Fisher, 421 F.3d at 1377-78 (citing Raytheon Co. 724 F.2d at 959 
("Proof of such utility is further supported when, as here, the inventions . . . have 
on their merits been met with commercial success")). 

At least one pharmaceutical company has licensed the presently claimed 
mouse. See Amendment dated June 22, 2005, pages 7-8; see also Declaration of 
Robert Driscoll filed June 22, 2005 (Exhibit K). The terms of the license 
agreement prevent the Appellant from revealing the name of the customer; 
however, the customer ranks among the top 10 pharmaceutical companies 
worldwide (based on sales). See Declaration of Robert Driscoll. The knockout 
mice provided by Deltagen are being used to study the function of the targeted 
gene and for human therapeutic drug development. See Driscoll Declaration, 
page 2, para. no. 4. According to the Federal Circuit in In re Fisher: 

[The Appellant] did not present any evidence showing that agricultural 
companies have purchased or even expressed any interest in the 
claimed ESTs. And, it is entirely unclear from the record whether such 
business entities ever will. Accordingly, while commercial success may 
support the utility of an invention, it does not do so in this case. See 
Raytheon Co. v. Roper, 724 F.2d 951, 959 (Fed. Cir. 1983)(stating that 
proof of a utility may be supported when a claimed invention meets 
with commercial success). 

In re Fisher, 421 F.3d at 1377-78. In this case, Appellant has submitted 
evidence that the claimed invention has been purchased and delivered to at least 
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one large pharmaceutical company. Unlike the Appellant in In re Fisher, 
Appellant has presented evidence that the PTP36 knockout mouse has actually 
been used in the real world. Just as "[p]eople rarely, if ever, appropriate useless 
inventions," Raytheon Co. 724 F. 2d at 959, it is abundantly clear that large 
pharmaceutical companies rarely, if ever, purchase useless inventions. Thus, the 
delivery of the claimed PTP36 knockout mouse more than satisfies the practical 
utility requirement of Section 101 as it cannot be reasonably argued that a 
claimed invention which has been actually purchased and delivered to those 
skilled in the art has no "real world" use. 

The Examiner argues commercial success is only established by 
"worldwide use" and "high revenue" from sales. See Advisory Action mailed 
July 14, 2005. The Examiner further argues that the sale of the claimed PTP36 
knockout mouse to one pharmaceutical company does not amount to 
commercial success. See id. Appellant submits that commercial success must be 
determined on a market-by-market basis, rather than using an arbitrary yardstick 
such as "worldwide use." The market for PTP36 knockout mice— which, as 
described above, are used for validating drug targets— is comprised of only those 
few large pharmaceutical companies who have the resources to develop new 
pharmaceutical products. As is well known in the art, the cost of developing a 
new pharmaceutical is simply staggering: one recent economic study found that 
the average cost of research and development for a new drug is $802 million. See 
DiMiasi et al, Journal of Health Economics 22 (2003) 151-185 (Exhibit L). 
Therefore, the sale of the claimed PTP36 knockout mouse to at least one of the 
ten largest pharmaceutical companies in the world constitutes "commercial 
success." 
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2. The PTP36 Knockout Mice Have Further Well-Established 
Patentable Utility As They Are Useful For Validating The 
PTP36 Gene As A "Draggable" Target 

PTP36 is a member of the protein tyrosine phosphatase family. 
Phosphatases are currently under intense scrutiny as targets for the development 
of drugs. See Huijsduijnen et aL, Expert Opin. Ther. Targets, 2002, 6:624 (see 
Exhibit M). In one recent report, it was estimated that 4% of the -3,000 genes in 
the "druggable genome" are protein phosphatases (the "druggable genome" is a 
subset of the human genome, and is comprised of those genes whose products 
are likely able to bind to drug-like molecules and are, thus, pharmacologically 
accessible targets). In other words, there are about 100 protein phosphatases that 
are capable of serving as drug targets. See Hopkins & Groom, Nature Reviews 
Drug Discovery 1:727 (2002) (Exhibit N). 

The process of determining whether a given gene in the druggable 
genome is a worthwhile target for the development of a drug is termed "target 
validation." At the time of filing, and to this day, the "gold standard" for target 
validation is the knockout mouse. A knockout mouse mimicks the effect of 
treating a wild-type mouse with a drug that specifically inhibits the function of 
the target gene. By determining the phenotypic consequences of the disruption 
of the target gene, one can see the likely effect— and side effects— of such a drug 
without first incurring the massive expense of developing and testing a 
candidate drug. A recent article illustrates that the use of mouse knockouts in 
drug target validation was a well-established — and highly successful— utility at 
the time of filing: 

After a decade of using mouse knockouts, the data on their predictive 
power in drug discovery is irrefutable. The top 100 selling drugs in 
2001 are directed only to 29 drug targets, many with multiple agents 
addressing the same target. Of these 29 targets, 23 have been knocked 
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out and in every case the knockout mouse was highly predictive as to 
the on-target effects and side effects of the associated drugs. 

See Arthur T. Sands, Industrializing Breakthrough Discovery, Current 
Drug Discovery, Aug. 2002, at 21.) (emphasis added) (Exhibit O). 

Another phosphatase, PTP1B, was validated as a target using this 
approach. Specifically, since mice with disruptions in PTP1B are hyper-sensitive 
to insulin and resistant to diet-induced obesity, it was predicted that specific 
PTP1B inhibitors could be used to treat type 2 diabetes and obesity. See Elchebly 
et al., Science 283:1544 (1999) (Exhibit P). PTP1B inhibitors are currently the 
focus of considerable drug discovery efforts by major pharmaceutical companies. 

The claimed mice mimic the effects of treating a wild-type mouse with a 
drug that specifically inhibits the activity of the PTP36 phosphatase. Thus, the 
claimed mice have "a significant and presently available benefit to the public," In 
re Fisher, 421 F.3d 1371, because they actually have been made and studied and 
actually have provided the public with the useful knowledge that the effects of a 
PTP36 inhibitor on a female would likely be a hormonal imbalance leading to 
androgenization. This information is critically useful for pharmaceutical 
companies in determining whether they should direct their research resources 
towards developing inhibitors of PTP36. In addition, this information is useful 
to pharmaceutical companies because it reveals the side effects of phosphatase 
inhibitors that are not totally specific for a targeted phosphatase. For example, if 
a phosphatase inhibitor drug targets the PTP1B phosphatase (a major target in 
diabetes and obesity), but also has an inhibitory effect on PTP36 in vitro, then the 
side effects of treatment with this PTP1B inhibitor can be predicted using 
information provided by the claimed mouse. 
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A recent review article has independently confirmed the utility of the 
claimed mice in validating phosphatases, specifically including PTP36, as 
druggable targets. Specifically, in Huijsduijnen et al., Expert Opin. Ther. 
Targets, 2002, 6:624-643 (emphasis added) (Exhibit M), the authors state: 

Validating a PTP drug target is not a trivial task. Much more so than 
for other signalling components, predicting the effects of PTP inhibition 
from overexpression or substrate specificity studies alone has not been 
very successful. The history of PTPlB's discovery as a target clearly 
illustrates that purely deterministic approaches alone are not 
appropriate for PTPs.... An effective, systematic PTP target approach 
would instead focus on knockout animals and antisense or other 
inhibitors as tested in disease models. 

In their discussion of the utility of knockout mice in validating PTPs as 
targets, Huijsduijnen et al. specifically refer to PCT Publication No. WO 
02/45500— the PCT counterpart of the instant U.S. application— as teaching that 
PTP36 knockout female mice are androgenized with no mammary gland tissue. 
Thus, Huijsduijnen et al. independently confirm that the utility of using PTP36 
knockout mice to validate PTP36 as a druggable target is well-established, 
substantial, and specific. 



3. The Subject Matter Of Claims 28-32, 37, 47, 53-57 Has Further 
Patentable Utility As The PTP36 Knockout Mouse Is Useful 
As A Model For Studying Androgenization And For 
Identifying Therapeutic Agents For The Treatment Of 
Androgenization. 

As described in Section VII.A.4.(a) supra, the claimed mice manifest 
androgenization, also known in the art as "hyperandrogenism." 
Hyperandrogenism is usually driven by excessive androgen (i.e., male hormone) 
production by the ovaries, the adrenal glands, or both. It can result in hirsutism, 
acne, androgenic alopecia, oligo-ovulation, irregular menstrual bleeding, and 
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breast atrophy (reduction in mammary gland tissue). Hyperandrogenism may 
ultimately lead to life-threatening cardiovascular problems and metabolic 
disorders, such as type 2 diabetes mellitus. See American Association of 
Clinical Endocrinologists Medical Guidelines for Clinical Practice for the 
Diagnosis and Treatment of Hyperandrogenic Disorders, Endocrine Practice 
7:121 (2001) (Exhibit Q). Approximately 7% of reproductive-aged women 
experience hyperandrogenism. See Azziz et aL, J. Clin. Endocrinol. Metab. 
2004, 89(2):453-62 (Exhibit R). 

The claimed mouse therefore has a credible, specific, and substantial 
utility as a model of human disease, contrary to the Examiner's assertion. See 
Office Action mailed March 22, 2005 at page 5. Specifically, the claimed PTP36 
knockout mouse has utility as a model of hyperandrogenism. As such, the 
claimed mice are useful for studying the pathogenesis of hyperandrogenism. 
The claimed mice— and tissues and cells isolated from the claimed mice — also 
have a credible, specific, and substantial utility as they can be used in methods 
for identifying drugs that treat hyperandrogenism. Again, this stands contrary 
to the Examiner's assertion that the claimed mice have no utility in the screening 
for therapeutic agents. See Office Action mailed March 22, 2005 at page 4. 
Methods for identifying such drugs are described at page 17, line 1 to page 18, 
line 15 of the specification. 

4. The Subject Matter Of Claims 28-32, 37, 47, 53-57 Has Further 
Patentable Utility As The PTP36 Knockout Mouse Is Useful 
As A Model For Studying The Role Of PTP36 In Cancer Cell 
Metastasis 

At the time of filing, it was well known in the art that PTP36 plays a role 
in regulating cell adhesion (see the Background of the Invention; see also Section 
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VILC.4 supra). At the time of filing, it was also well known that cancer cells must 
disrupt the adhesive contacts they make with other cells in order to migrate from 
a primary tumor mass into the surrounding tissue i.e., to metastasize. For 
example, see Cooper, The Cell: A Molecular Approach, Chapter 15 (2000) 
(Exhibit S) (emphases added): 

Cancer cells are also less stringently regulated than normal cells by cell- 
cell and cell-matrix interactions. Most cancer cells are less adhesive 
than normal cells, often as a result of reduced expression of cell surface 
adhesion molecules. For example, loss of E-cadherin, the principal 
adhesion molecule of epithelial cells, is important in the development of 
carcinomas (epithelial cancers). As a result of reduced expression of cell 
adhesion molecules, cancer cells are comparatively unrestrained by 
interactions with other cells and tissue components, contributing to the 
ability of malignant cells to invade and metastasize. The reduced 
adhesiveness of cancer cells also results in morphological and 
cytoskeletal alterations: Many tumor cells are rounder than normal, in 
part because they are less firmly attached to either the extracellular 
matrix or neighboring cells. 



Because PTP36 was known to be involved in modulating cell adhesion, 
and because the modulation of cell adhesion was known to be involved in 
metastasis, one skilled in the art at the time of filing would immediately 
understand that the claimed PTP36 mice can be used to identify the role of the 
PTP36 gene in the growth and progression of tumors, specifically in the 
metastasis of cancer cells. Hence, the claimed mouse has a well-established, 
substantial, and specific utility in studying the role of PTP36 in cancer cell 
metastasis. 

After the filing date of the instant application, the role of PTP36 mutations 
in the development of cancer was confirmed. See Wang et al, 2004, Science 
304:1164 (Exhibit T) ; note that Wang et al refer to PTP36 using the synonymous 
name // PTPN14 ,, ). Specifically, it has been observed that inactivating mutations 
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in the PTP36 gene are observed in colorectal cancers. In addition, Wadham et al., 
Molecular Biology of the Cell 14:2520 (2003) (Exhibit U) have shown that 
inactivating mutations in PTP36 cause a decrease in cell-cell adhesion, with a 
concomitant increase in cell migration. These post-filing date references 
independently confirm the well-established, substantial, and specific utility of 
the claimed mice in the study of PTP36 in cancer cell metastasis. 



5. The Subject Matter Of Claim 57 Has Further Patentable 

Utility As The PTP36 Knockout Mouse Is Useful For Studying 
Expression Of The PTP36 Gene. 

The mouse of claim 57 further comprise a lacZ reporter gene. As 
explained in Section VILA supra, the lacZ gene was inserted into the loci of the 
PTP36 gene and is under the control of the PTP36 endogenous promoter. Thus, 
lacZ is expressed where PTP36 is normally expressed. Expression may be 
studied by staining tissues and cells isolated from heterozygous knockout mice 
with X-gal and then observing the blue staining which is indicative of (3- 
galactosidase expression. Expression analysis is therefore useful for analyzing 
where PTP36 is expressed. 

According to the specification: 

LacZ Reporter Gene Expression. In general, tissues from 7-12 week 
old heterozygous mutant mice were analyzed for lacZ expression. 
Organs from heterozygous mutant mice were frozen, sectioned (10 |^m), 
stained and analyzed for lacZ expression using X-Gal as a substrate for 
beta-galactosidase, followed by a Nuclear Fast Red counterstaining. 
In addition, for brain, wholemount staining was performed. The 
dissected brain was cut longitudinally, fixed and stained using X-Gal as 
the substrate for beta-galactosidase. The reaction was stopped by 
washing the brain in PBS and then fixed in PBS-buffered formaldehyde. 
Wild-type control tissues were also stained for lacZ expression to reveal 
any background or signals due to endogenous beta-galactosidase 
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activity. The following tissues can show staining in the wild-type 
control sections and are therefore not suitable for X-gal staining: small 
and large intestines, stomach, vas deferens and epididymis. It has been 
previously reported that these organs contain high levels of 
endogenous beta-galactosidase activity. 

Striking lacZ (beta-galactosidase) expression was observed in 
blood vessels and adipocytes. LacZ expression was detectable in brain, 
eye, sciatic nerve, Harderian glands, thymus, spleen, aorta, heart, lung, 
liver, gall bladder, pancreas, kidney, urinary bladder, trachea, larynx, 
esophagus, thyroid gland, adrenal glands, salivary glands, tongue, 
skeletal muscle, skin, male and female reproductive systems. 



Specification, page 50, lines 10-26. Using the claimed mice to study gene 
expression is specific since the lacZ gene is specifically inserted into the PTP36 
gene, thereby falling under the control of the PTP36 gene promoter and thus 
revealing where the PTP36 gene is expressed. 

Use of the claimed mice to study gene expression constitutes a substantial 
use as it represents a real world use. The use of heterozygous mice containing a 
reporter gene in studying gene expression is clearly recognized by those skilled 
in the art as a real world use: 

Null-reporter alleles should be created 

The project should generate alleles that are as uniform as possible, to 
allow efficient production and comparison of mouse phenotypes. The 
alleles should achieve a balance of utility, flexibility, throughput and 
cost. A null allele is an indispensable starting point for studying the 
function of every gene. Inserting a reporter gene (e.g., (i-galactosidase 
or green fluorescent protein) allows a rapid assessment of which cell 
types normally support the expression of that gene. 

Austin et al., Nature Genetics (2004) 36(9):921-24, 922) discussing the 
NIH's Knockout Mouse Project (emphasis in original; emphasis added)(Exhibit 
B) (see Amendment filed February 17, 2005). 
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Unlike the ESTs in In re Fisher, Appellant has actually used the invention 
in the asserted manner to generate useful information regarding the expression 
of the PTP36 gene. The claimed mice have provided "an immediate, well- 
defined, real-world benefit to the public/' In re Fisher, 421 F.3d at 1376, because 
they actually have revealed where the PTP36 gene is expressed. 

The Examiner argues that "studying the expression of a gene in which the 
function is unknown is not a substantial utility." Office Action dated March 22, 
2005 at page 9. As shown above, studying the expression of a gene is a real- 
world use that is immediately appreciated as such by those skilled in the art. 
Moreover, contrary to the Examiner's assertion, the function of the PTP36 gene is 
known. See Section VII.C.4 supra. Using the claimed mouse to study the 
expression of a gene that has a known function is, contrary to the Examiner's 
assertion, a well-established and substantial utility. It is also a specific utility 
because only the claimed mouse having a LacZ gene inserted in the PTP36 gene 
can be used to study the expression of the PTP36 gene; mice having a LacZ gene 
inserted into another gene could not be used in this regard. 

D. The Patent Application Enables One of Ordinary Skill in the Art 
to Use the Subject Matter of Claims 28-32, 37, 47, and 53-57 
Because the Subject Matter of Claims 28-32, 37, 47, and 53-57 Has 
a Specific and Substantial Utility 

The rejection of claims 28-32, 37, 47, and 53-57 under 35 U.S.C. § 112, first 
paragraph for lack of enablement is entirely contingent upon the rejection of 
claims 28-32, 37, 47, and 53-57 under 35 U.S.C. § 101 for lack of utility. 
Accordingly, because the claimed invention has a patentable utility, as 
demonstrated supra, the 35 U.S.C. § 112, first paragraph rejection should be 
withdrawn. 
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The Applicant also notes that the Examiner originally agreed that the 
invention was enabled. In the Office Action mailed June 3, 2003, the Examiner 
stated: 

The specification and the working examples provide sufficient 
guidance to use the invention of a female homozygous knockout mouse 
comprising a disruption [of] the PTP36 gene which result in no 
expression of the protein, wherein said mouse exhibits phenotypic 
features including androgenization, uterine abnormality, abnormal 
body or organ weight and abnormal physical feature as compared to 
wild type mice. 

Id. at page 5. 

Subsequently, the Examiner argued the specification failed to teach how 
to use "a PTP36 transgenic knockout mouse, wherein the homozygous knockout 
mouse exhibits phenotype of an uterine abnormality, hormonal imbalance, 
androgenization, increased body weight, increased organ weight, reduced or 
absent mammary tissue or increased anogenital distance/ 7 Office Action mailed 
April 27, 2004, page 3. 

Appellant respectfully submits that the Examiner was correct the first 
time: the specification clearly teaches one skilled in the art how to use the 
claimed invention. Since the "how to use" prong of Section 112, first paragraph 
is satisfied, the utility requirement of Section 101 is also satisfied. 

E. The Rejection of Claims 28-32, 37, 47, 53-54, and 57 under 35 

U.S.C. § 112, first paragraph (new matter), Is Improper Because 
The Specification Reasonably Conveys That The Applicant Was 
In Possession Of The Claimed Subject Matter At the Time The 
Application Was Filed. 

The Examiner rejected claims 28-32, 37, 47, 53-54, and 57 in the Office 
Action mailed March 22, 2005 because the specification allegedly does not 
disclose a knockout mouse comprising a null allele that comprises exogenous 
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DNA or a visible marker, as was recited in the claims pending on March 22, 2005. 
Without acquiescing in this rejection, the Appellant amended the claims pending 
in the Office Action mailed March 22, 2005 to remove the term "exogenous 
DNA" and to replace the term "visible marker" with "LacZ." See Supplemental 
Amendment mailed September 8, 2005. The Supplemental Amendment has been 
entered. Accordingly, it is believed that the rejection is overcome. 

F. The Rejection of Claim 32 under 35 U.S.C. § 112, second 
paragraph, is Improper 

In the Office Action of March 22, 2005, the Examiner advised the 
Appellant to add commas to the recitation "increased thymus weight increased 
liver weight relative to body weight and increased spleen weight relative to body 
weight" in claim 32. In the Supplemental Amendment mailed September 8, 2005, 
the Appellant made the requested changes. Accordingly, it is believed that the 
rejection is overcome. 
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VIII. CLAIMS APPENDIX. 

The following claims are appealed and are presented as a clean copy. 
Claims canceled or withdrawn during prosecution are not listed. 

28. A transgenic mouse whose genome comprises a null allele in the 
endogenous PTP36 gene. 

29. The transgenic mouse of claim 54, wherein the female mouse exhibits, 
relative to a wild-type control mouse, a uterine abnormality comprising uterine 
dilation. 

30. The transgenic mouse of claim 54, wherein the female mouse exhibits, 
relative to a wild-type control mouse, a uterine abnormality comprising keratin 
in the uterine horns. 

31. The transgenic mouse of claim 54, wherein the female mouse exhibits, 
relative to a wild-type control mouse, a uterine abnormality comprising keratin 
in the uterine lumen. 

32. The transgenic mouse of claim 54, wherein said mouse exhibits, relative to 
a wild-type control mouse, increased organ weight comprising at least one of the 
following: increased liver weight, increased spleen weight, increased thymus 
weight, increased liver weight relative to body weight, and increased spleen 
weight relative to body weight. 
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37. A cell or tissue isolated from the transgenic mouse of claim 28. 

47. A method of producing the transgenic mouse of claim 28, the method 
comprising: 

a. introducing a targeting construct capable of disrupting an 
endogenous PTP36 allele into a mouse embryonic stem cell; 

b. selecting for the mouse embryonic stem cell that has 
undergone homologous recombination; 

c introducing the mouse embryonic stem cell selected for in 
step (b) into a blastocyst; 

d. implanting the resulting blastocyst into a pseudopregnant 
mouse, wherein the resultant mouse gives birth to a chimeric 
mouse; and 

e. breeding the chimeric mouse to produce the transgenic 
mouse. 

53. The transgenic mouse of claim 28 wherein the mouse is heterozygous for 
said null allele. 

54. The transgenic mouse of claim 28 wherein the mouse is homozygous for 
said null allele. 
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55. The transgenic mouse of claim 28 wherein said null allele comprises a 
gene encoding a selection marker. 

56. The transgenic mouse of claim 55 wherein said gene is a neomycin 
resistance gene. 

57. The transgenic mouse of claim 56 wherein said null allele further 
comprises a lacZ gene. 
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EVIDENCE APPENDIX 

Enclosed, please find copies of the following references: 
Exhibit A 

Background on Mouse as a Model Organism 
<<http://www.genome.gov/pfv.cfm?pageid=10005834>> 

Exhibit B 

Austin et al., Nature Genetics (2004) 36(9):921-24, 921 
Exhibit C 

Alberts, Molecular Biology of the Cell, p.543 (4th Ed. Garland Science (2002)) 
Exhibit D 

Genes VII, p. 508 (Lewin, Oxford University Press (2000)) 
Exhibit E 

Joyner, Gene Targeting: A Practical Approach, preface, Oxford University Press 
(2000) 

Exhibit F 

Matise et al., Production of Targeted Embryonic Stem Cell Clones, p. 101 
in Joyner, Gene Targeting: A Practical Approach, Oxford University 
Press 2000) 

Exhibit G 

Crawley, What's Wrong With My Mouse Behavioral Phenotyping of Transgenic 
and Knockout Mice, p.l, rear cover, Wiley-Liss 2000) 

Exhibit H 

Doetschman, Laboratory Animal Science 49:137-143, 137 (1999) 
Exhibit I 

Researchers to Gain Wider Access to Knockout Mice 
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<<http://www.gen0me.g0v/l 70 15131 » 
Exhibit T 

Bradley et al., Biotechnology 10:534-538, 535-6 (1992) 
Exhibit K 

Deltagen and Merck Enter Into DeltaBase License Agreement 

<<http://www.sec.gov/Archives/edgar/data/1034072/000089843002000648/dex9 

92.htm» 

Exhibit L 

Declaration of Robert Driscoll, filed June 22, 2005 
Exhibit M 

DiMiasi et al., Journal of Health Economics 22 (2003) 151-185 
Exhibit N 

Huijsduijnen et al., Expert Opin. Ther. Targets, 2002, 6:624 
Exhibit O 

Hopkins & Groom, Nature Reviews Drug Discovery 1 :727 (2002) 
Exhibit P 

Arthur T. Sands, Current Drug Discovery, Aug. 2002, at 21 . 
Exhibit O 

Elchebly et al., Science 283:1544 (1999) 
Exhibit R 

American Association of Clinical Endocrinologists Medical Guidelines for 
Clinical Practice for the Diagnosis and Treatment of Hyperandrogenic Disorders, 
Endocrine Practice 7:121 (2001) 
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Exhibit S 

Azziz et al., J. Clin. Endocrinol. Metab. 2004, 89(2):453-62 
Exhibit T 

Cooper, The Cell: A Molecular Approach, Chapter 15 (2000) 
Exhibit U 

Wang et al, 2004, Science 304:1 164 
Exhibit V 

Wadham et al., Molecular Biology of the Cell 14:2520 (2003) 
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X, CLOSING REMARKS 

For the foregoing reasons, Appellant submits that the rejection of claims 
28-32, 37, 47 and 53-57 under 35 U.S.C. § 101 for lack of utility is improper; that 
the rejection of claims 28-32, 37, 47 and 53-57 under 35 U.S.C. § 112, first 
paragraph, for lack of enablement is improper; that the rejection of claims 28-32, 
37, 47, 53-54, and 57 under 35 U.S.C. § 112, first paragraph, for lack of written 
description is improper; and that the rejection of claim 32 for indefiniteness 
under 35 U.S.C. § 112, second paragraph, is improper. 

Please charge the Appeal Brief fee of $250 for a small entity to Deposit 
Account No. 502775. It is believed that no other fees are due with this Appeal 
Brief. If this is in error, please charge any additional fees to Deposit Account No. 



This constitutes a request for any needed extension of time under 37 
C.F.R. § 1.136(a) and an authorization to charge all fees therefore to deposit 
account No. 502775 if not otherwise specifically requested. 

Respectfully submitted, 



502775. 
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EXHIBIT A 



Background on Mouse as a Model Organism 
«http://www.genome.gov/pfv.cfm?pageid=10005834» 
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Background on Mouse as a Model Organism 

Over the past century, the mouse has developed into the premier mammalian model system 
for genetic research. Scientists from a wide range of biomedical fields have gravitated to the 
mouse because of its close genetic and physiological similarities to humans, as well as the 
ease with which its genome can be manipulated and analyzed. 

Although yeasts, worms and flies are excellent models for studying the cell cycle and many 
developmental processes, mice are far better tools for probing the immune, endocrine, 
nervous, cardiovascular, skeletal and other complex physiological systems that mammals 
share. Like humans and many other mammals, mice naturally develop diseases that affect 
these systems, including cancer, atherosclerosis, hypertension, diabetes, osteoporosis and 
glaucoma. In addition, certain diseases that afflict humans but normally do not strike mice, 
such as cystic fibrosis and Alzheimer's, can be induced by manipulating the mouse genome 
and environment. Adding to the mouse's appeal as a model for biomedical research is the 
animal's relatively low cost of maintenance and its ability to quickly multiply, reproducing as 
often as every nine weeks. 

Mouse models currently available for genetic research include thousands of unique inbred 
strains and genetically engineered mutants. There are mice prone to different cancers, 
diabetes, obesity, blindness, Lou Gehrig's disease, Huntington's disease, anxiety, aggressive 
behavior, alcoholism and even drug addiction. Immunodeficient mice can also be used as 
hosts to grow both normal and diseased human tissue, a boon for cancer and AIDS research. 

In the early days of biomedical research, scientists developed mouse models by selecting and 
breeding mice to produce offspring with the desired traits. Researchers also learned to 
produce useful, new models of genetic disease quickly and in large numbers by exposing 
mice to DNA-damaging chemicals, a process known as chemical mutagenesis. 

In recent decades, researchers have utilized an array of innovative genetic technologies to 
produce custom-made mouse models for a wide array of specific diseases, as well as to study 
the function of targeted genes. One of the most important advances has been the ability to 
create transgenic mice, in which a new gene is inserted into the animal's germline. Even 
more powerful approaches, dependent on homologous recombination, have permitted the 
development of tools to "knock out" genes, which involves replacing existing genes with 
altered versions; or to "knock in" genes, which involves altering a mouse gene in its natural 
location. To preserve these extremely valuable strains of mice and to assist in the 
propagation of strains with poor reproduction, researchers have taken advantage of state-of- 
the-art reproductive technologies, including cryopreservation of embryos, in vitro fertilization 
and ovary transplantation. 

The Jackson Laboratory, a publicly supported national repository for mouse models in Bar 
Harbor, Maine, has played a crucial role in the development of the mouse into the leading 
model for biomedical research. Established in 1929, the non-profit center pioneered the use 
of inbred laboratory mice to uncover the genetic basis of human development and disease. In 
fact, the famous "Black 6" or C57BL/6J mouse strain whose genome is the focus of the 
landmark sequencing effort was developed in the early 1920s by The Jackson Laboratory 
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founder Clarence Cook Little. 

Today, researchers at The Jackson Laboratory pursue projects in areas that include cancer, 
development and aging, immune system and blood disorders, neurological and sensory 
disorders, and metabolic diseases. Informatics researchers work with the public sequencing 
consortium to curate and integrate the sequenced mouse genome data with the wealth of 
biological knowledge collected in Jackson's Mouse Genome Informatics resource. 

In addition, The Jackson Laboratory distributes 2,700 different strains and stocks as breeding 
mice, frozen embryos or DNA samples. In FY 2002 alone, the lab supplied approximately 2 
million mice to the international scientific community. 

Listed below is a sampling of mouse models developed and/or distributed by The Jackson 
Laboratory, along with brief descriptions of the human diseases they are helping scientists to 
understand: 

• Down Syndrome - One of the most common genetic birth defects in humans, occurring 
once in every 800 to 1,000 live births, Down syndrome results from an extra copy of 
chromosome 21, an abnormality known as trisomy. The Ts65Dn mouse, developed at 
The Jackson Laboratory, mimics trisomy 21 and exhibits many of the behavioral, 
learning, and physiological defects associated with the syndrome in humans, including 
mental deficits, small size, obesity, hydrocephalus and thymic defects. This model 
represents the latest and best improvement of Down syndrome models to facilitate 
research into the human condition. 

• Cystic Fibrosis (CF) - The Cftr knockout mouse has helped advance research into cystic 
fibrosis, the most common fatal genetic disease in the United States today, occurring in 
approximately one of every 3,300 live births. Scientists now know that CF is caused by 
a small defect in the gene that manufactures CFTR, a protein that regulates the 
passage of salts and water in and out of cells. Studies with the Cftr knockout have 
shown that the disease results from a failure to clear certain bacteria from the lung, 
which leads to mucus retention and subsequent lung disease. These mice have become 
models for developing new approaches to correct the CF defect and cure the disease. 

• Cancer - The p53 knockout mouse has a disabled Trp53 tumor suppressor gene that 
makes it highly susceptible to various cancers, including lymphomas and 
osteosarcomas. The mouse has emerged as an important model for human Li-Fraumeni 
syndrome, a form of familial breast cancer. 

• Glaucoma - The DBA/2 J mouse exhibits many of the symptoms that are often 
associated with human glaucoma, including elevated intraocular pressure. Glaucoma is 
a debilitating eye disease that is the second leading cause of blindness in the United 
States. 

• Type 1 Diabetes - This autoimmune disease, also known as Juvenile Diabetes, or 
Insulin Dependent Diabetes Mellitus (IDDM), accounts for up to 10 percent of diabetes 
cases. Non-obese Diabetic (NOD) mice are enabling researchers to identify IDDM 
susceptibility genes and disease mechanisms. 

• Type 2 Diabetes - A metabolic disorder also called Non-Insulin Dependent Diabetes 
Mellitus (NIDDM), this is the most common form of diabetes and occurs primarily after 
age 40. The leading mouse models for NIDDM and obesity research were all developed 
at The Jackson Laboratory: Cpe fat , Lep ob , Lepr* b and tub. 

• Epilepsy - The "slow-wave epilepsy," or swe, mouse is the only model to exhibit both of 
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the two major forms of epilepsy: petit mal (absence) and grand mal (convulsive). It 
shows particular promise for research into absence seizures, which occur most often in 
children. 

• Heart Disease - Elevated blood cholesterol levels and plaque buildup in arteries within 
three months of birth (even on a low-fat diet) are characteristics of several 
experimental models for human atherosclerosis: the Apoe knockout mouse and 
C57BU/6J. 

• Muscular Dystrophy - The Dmd mdx mouse is a model for Duchenne Muscular 
Dystrophy, a rare neuromuscular disorder in young males that is inherited as an X- 
linked recessive trait and results in progressive muscle degeneration. 

• Ovarian Tumors - The SWR and SWXJ mouse models provide excellent research 
platforms for studying the genetic basis of ovarian granulosa cell tumors, a common 
and very serious form of malignant ovarian tumor in young girls and post-menopausal 
women. 

Contact: 

Geoff Spencer 
NHGRI 

Phone: (301) 402-0911 
© Top of page 
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EXHIBIT B 



Austin et al., Nature Genetics (2004) 36(9):921-24, 921 



COMMENTARY 



The Knockout Mouse Project 

Mouse knockout technology provides a powerful means of elucidating gene function in vivo, and a publicly available 
genome-wide collection of mouse knockouts would be significantly enabling for biomedical discovery. To date, published 
knockouts exist for only about 10% of mouse genes. Furthermore, many of these are limited in utility because they have 
not been made or phenotyped in standardized ways, and many are not freely available to researchers. It is time to harness 
new technologies and efficiencies of production to mount a high-throughput international effort to produce and 
phenotype knockouts for all mouse genes, and place these resources into the public domain. 



Now that the human and mouse genome 
sequences are known 1 " 3 , attention has turned 
to elucidating gene function and identifying 
gene products that might have therapeutic 
value. The laboratory mouse (Mus musculus) 
has had a prominent role in the study of 
human disease mechanisms throughout the 
rich, 100-year history of classical mouse genet- 
ics, exemplified by the lessons learned from 
naturally occurring mutants such as agouti 4 , 
reeler 5 and obese 6 . The large-scale production 
and analysis of induced genetic mutations in 
worms, flies, zebrafish and mice have greatly 
accelerated the understanding of gene function 
in these organisms. Among the model organ- 
isms, the mouse offers particular advantages 
for the study of human biology and disease: (i) 
the mouse is a mammal, and its development, 
body plan, physiology, behavior and diseases 
have much in common with those of humans; 
(ii) almost all (99%) mouse genes have 
homologs in humans; and (iii) the mouse 
genome supports targeted mutagenesis in spe- 
cific genes by homologous recombination in 
embryonic stem (ES) cells, allowing genes to be 
altered efficiently and precisely. 

The ability to disrupt, or knock out, a spe- 
cific gene in ES cells and mice was developed 
in the late 1980s (ref. 7), and the use of knock- 
out mice has led to many insights into human 
biology and disease 8 " 11 . Current technology 
also permits insertion of 'reporter' genes into 
the knocked-out gene, which can then be 
used to determine the temporal and spatial 



The Comprehensive Knockout Mouse 
Project Consortium* 

* Authors and their affiliations are listed at the 
end of the paper. 



expression pattern of the knocked-out gene in 
mouse tissues. Such marking of cells by a 
reporter gene facilitates the identification of 
new cell types according to their gene expres- 
sion patterns and allows further characteriza- 
tion of marked tissues and single cells. 

Appreciation of the power of mouse genet- 
ics to inform the study of mammalian physi- 
ology and disease, coupled with the advent of 
the mouse genome sequence and the ease of 
producing mutated alleles, has catalyzed pub- 
lic and private sector initiatives to produce 
mouse mutants on a large scale, with the goal 
of eventually knocking out a substantial por- 
tion of the mouse genome 12,13 . Large-scale, 
publicly funded gene-trap programs have 
been initiated in several countries, with the 
International Gene Trap Consortium coordi- 
nating certain efforts and resources 14 " 17 . 

Despite these efforts, the total number of 
knockout mice described in the literature is 
relatively modest, corresponding to only - 10% 
of the -25,000 mouse genes. The curated 
Mouse Knockout & Mutation Database lists 
2,669 unique genes (C. Rathbone, personal 
communication), the curated Mouse Genome 
Database lists 2,847 unique genes, and an 
analysis at Lexicon Genetics identified 2,492 
unique genes (B.Z., unpublished data). Most 
of these knockouts are not readily available to 
scientists who may want to use them in their 
research; for example, only 415 unique genes 
are represented as targeted mutations in the 
Jackson Laboratory's Induced Mutant 
Resource database (S. Rockwood, personal 
communication). 

The converging interests of multiple mem- 
bers of the genomics community led to a meet- 
ing to discuss the advisability and feasibility of 



a dedicated project to produce knockout alleles 
for all mouse genes and place them into the 
public domain. The meeting took place from 
30 September to 1 October 2003 at the 
Banbury Conference Center at Cold Spring 
Harbor Laboratory. The attendees of the meet- 
ing are the authors of this paper. 

Is a systematic project warranted? 

A coordinated project to systematically knock 
out all mouse genes is likely to be of enormous 
benefit to the research community, given the 
demonstrated power of knockout mice to eluci- 
date gene function, the frequency of unpre- 
dicted phenotypes in knockout mice, the 
potential economies of scale in an organized 
and carefully planned project, and the high cost 
and lack of availability of knockout mice being 
made in current efforts. Moreover, implement- 
ing such a systematic and comprehensive plan 
will greatly accelerate the translation of genome 
sequences into biological insights. Knockout ES 
cells and mice currently available from the pub- 
lic and private sectors should be incorporated 
into the genome-wide initiative as much as 
possible, although some may be need to be pro- 
duced again if they were made with suboptimal 
methods (&£., not including a marker) or if 
their use is restricted by intellectual property or 
other constraints. The advantages of such a sys- 
tematic and coordinated effort include efficient 
production with reduced costs; uniform use of 
knockout methods, allowing for more compa- 
rability between knockout mice; and ready 
access to mice, their derivatives and data to all 
researchers without encumbrance. Solutions to 
the logistical, organizational and informatics 
issues associated with producing, characteriz- 
ing and distributing such a large number of 
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Figure 1 Structure of resource production in the proposed KOMP. Using the mouse genome sequence 
as a foundation, knockout alleles in ES cells will be produced for all genes. A subset of ES cell 
knockouts will be used each year to produce knockout mice, determine the expression pattern of the 
targeted gene in a variety of tissues and carry out screening-level (Tier 1) phenotyping. In a subset of 
mouse lines, transcriptome analysis and more detailed system-specific (Tier 2) phenotyping will be 
done. Finally, specialized phenotyping will be done on a smaller number of mouse lines with 
particularly interesting phenotypes. AM stages will occur within the purview of the KOMP except for the 
specialized phenotyping, which will occur in individual laboratories with particular expertise. 



mice will draw from the experience of related 
projects in the private sector and in academia, 
which have made or phenotyped hundreds of 
knockout mice using a variety of techniques. 
Lessons learned from these projects include the 
need for redundancy at each step to mitigate 
pipeline bottlenecks and the need for robust 
informatics systems to track the production, 
analysis, maintenance and distribution of thou- 
sands of targeting constructs, ES cells and mice. 

Null-reporter alleles should be created 

The project should generate alleles that are 
as uniform as possible, to allow efficient pro- 
duction and comparison of mouse pheno- 
types. The alleles should achieve a balance of 
utility, flexibility, throughput and cost, A 
null allele is an indispensable starting point 
for studying the function of every gene. 
Inserting a reporter gene (e.g., (3-galactosi- 
dase or green fluorescent protein) allows a 
rapid assessment of which cell types nor- 
mally support the expression of that gene. 
Therefore, we propose to produce a null- 
reporter allele for each gene. Making each 
mutation conditional in nature by adding 
ris-elements (e.g., lox? or FRT sites) would 



be desirable, but we do not advocate this as 
part of the mutagenesis strategy unless the 
technological limitations currently associ- 
ated with generating conditional targeted 
mutations on a large scale and in a cost- 
effective manner can be overcome. 

A combination of methods should be used 

Various methods can be used to create 
mutated alleles, including gene targeting, 
gene trapping and RNA interference. 
Advantages of conventional gene targeting 
include flexibility in design of alleles, lack of 
limitation to integration hot spots, reliability 
for producing complete loss-of-function alle- 
les, ability to produce reporter knock-ins and 
conditional alleles, and ability to target splice 
variants and alternative promoters. BAC- 
based targeting has the potential advantages 
of higher recombination efficiencies and flex- 
ibility for producing complex mutated alle- 
les 18 . Gene trapping is rapid, is cost-effective 
and produces a large variety of insertional 
mutations throughout the genome but can be 
somewhat less flexible 17,19 " 21 . There is uncer- 
tainty regarding the percentage of gene traps 
that produce a true null allele and the fraction 



of the genome that can ultimately be covered 
by gene-trap mutations. Trapping is not 
entirely random but shows preference for 
larger transcription units and genes more 
highly expressed in ES cells. In recent studies, 
gene trapping was estimated to potentially 
produce null alleles for 50-60% of all genes, 
perhaps more if a variety of gene-trap vectors 
with different insertion characteristics is 
used 17,21 . RNA interference offers enormous 
promise for analysis of gene function in 
mice 22 but is not yet sufficiently developed for 
large-scale production of gene modifications 
capable of reliably producing true null alleles. 
Both gene-targeting and gene-trapping meth- 
ods are suitable for producing large numbers 
of knockout alleles, and, given their comple- 
mentary advantages, a combination of these 
methods should be used to produce the 
genome- wide collection of null- reporter alle- 
les most efficiently. 

What should the deliverables be? 

A genome-wide knockout mouse project 
could deliver to the research community a 
trove of valuable reagents and data, including 
targeting and trapping constructs and vec- 
tors, mutant ES cell lines, live mice, frozen 
sperm, frozen embryos, phenotypic data at a 
variety of levels and detail, and a database 
with data visualization and mining tools. At a 
minimum, we believe that a comprehensive 
genome-wide resource of mutant ES cell lines 
from an inbred strain, each with a different 
gene knocked out, should be produced and 
made available to the community. Choosing 
an inbred line (129/SvEvTac or C57BL/6J), 
and evaluating the alternative of using F t ES 
cells and tetraploid aggregation to provide 
potential time savings, merits additional sci- 
entific review and discussion 23,24 . ES cells 
should be converted into mice at a rate con- 
sistent with project funding and the ability of 
the worldwide scientific community to ana- 
lyze them. Although the value and cost-effec- 
tiveness of systematically characterizing the 
mice is a matter of debate, a limited set of 
broad and cost-effective screens, probably 
including assessment of developmental 
lethality, physical examination, basic blood 
tests, and histochemical analysis of reporter 
gene expression, would be useful. More 
detailed phenotyping, based on findings 
from the initial screen or existing knowledge 
of the gene's function, could be done at spe- 
cialized centers. All ES cell clones and mice 
(as frozen embryos or sperm) should be 
available to any researcher at minimal cost, 
and all mouse phenotyping and reporter 
expression data should be deposited into a 
public database. 
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In determining how to implement the pro- 
ject, utility to the research community should 
be the standard for judging value. Each step 
after ES cell generation {e.g., mouse creation, 
breeding, expression analysis, phenotyping) 
will make the resource useful to more 
researchers but will also increase costs and sci- 
entific complexity. We therefore advocate a 
'pyramid' structure for the project (Fig. 1). At 
the base of the pyramid is the genome-wide 
collection of mutant ES cells for every mouse 
gene. Over time, a subset of these mutant ES 
cells should be made into mice and character- 
ized with an initial phenotype screen (Tier 1; 
Fig. 1) and analysis of tissue reporter-gene 
expression. A subset of these lines should be 
profiled by microarray analysis, and a subset of 
these profiled by system-specific (Tier 2) phe- 
notyping, based on the results of the Tier 1 
phenotyping, array studies, existing knowl- 
edge of the gene's function and the gene's tissue 
expression pattern. With time, the upper tiers 
of the pyramid will be filled out, eventually 
transforming the pyramid into a cube, with 
information of all types available for all genes. 

This project will require the resolution of 
numerous intellectual property claims involv- 
ing the production and use of knockout mice. 
To deal with the existing patents that cover 
the technologies and processes involved in the 
production of mutant mice, we suggest that a 
'patent pool', such as that used in the semi- 
conductor industry 25 , should be generated. 
Several individuals who represent entities that 
control patents on mouse knockout technolo- 
gies are authors on this paper, and they agree 
with this approach. We also agree that any 
mutant ES cells or mice produced should be 
placed immediately in the public domain. 

Mechanisms and costs 

ES cell production. Automated knockout 
construct and ES ceU production should be 
carried out in coordinated centers to ensure 
efficiency and uniformity. We estimate that 
most known mouse genes could be knocked 
out in ES cells within 5 years, using a combina- 
tion of gene-trapping and gene-targeting tech- 
niques. Gene trapping can produce a large 
number of mutated alleles quickly, but its 
progress should be monitored closely to deter- 
mine when its yield of new genes diminishes 17 
and, therefore, when targeting should be 
increasingly relied on. As large-scale trapping 
projects have already defined gene classes that 
probably cannot be knocked out by trapping 
{e.g., single-exon GPCRs, genes that are not 
expressed in ES cells), we propose that target- 
ing begin on those classes immediately. All ES 
cells should be made available to the research 
community, because this collection itself 



would be a valuable resource. Efforts in the 
public and private sectors have already 
knocked out many genes in ES cells, and, to the 
degree that the alleles produced fit the pre- 
scribed characteristics (i.e., null alleles with a 
reporter) and are available, every effort should 
be made to incorporate these into the planned 
public resource. Costs for generating this part 
of the resource were estimated at between 
$9-1 1 million/year for five years (these and all 
subsequent figures are direct costs). 

Mouse production. The subset of ES cells 
made into mice each year should be chosen by 
a peer-review process. Central facilities for 
high-efficiency mouse production, genotyp- 
ing, breeding, maintenance and archiving 
should be funded, to take advantage of effi- 
ciencies of scale in mouse creation and distri- 
bution. Researchers could apply to produce 
groups of mice outside the centers, as long as 
they meet the cost specifications of the pro- 
gram. All mice should be made available 
immediately to researchers as frozen embryos 
or sperm, for nominal distribution cost. An 
initial target of 500 new mouse lines per year 
would double the current rate at which new 
genes are knocked out in the public sector; we 
feel that this rate is within the capacity of the 
biomedical research community worldwide 
to absorb and analyze. We estimated the ini- 
tial cost of this level of mouse production to 
be $12.5-15 million per year. 

Reporter tissue expression analysis. 
Approximately 30 tissues from adult and 
developmental stages should be sampled to 
cover the main organ systems. Analysis meth- 
ods should be customized to the organ system 
and marker, and a searchable database of the 
sites of gene expression, and the images show- 
ing them, should be produced. Centers to 
carry out these analyses and data curation 
should be selected by peer review. We esti- 
mated the cost of this component for 500 
mouse lines to be S2.5-5 million per year, 
depending on how much tissue sectioning and 
cell -level analysis is done. 

Phenotyping. Tier 1 phenotyping should 
be a low-cost screen for clear phenotypes and 
should be done on all mouse lines produced. 
Tier 1 should include home-cage observation, 
physical examination, blood hematological 
and chemistry profiles, and skeletal radi- 
ographs. The centers producing the mice 
should carry out the Tier 1 analyses, at an esti- 
mated cost of $2.5 million per year for 500 
lines. Selected lines, chosen on the basis of 
findings from Tier 1 phenotyping, tissue 
expression patterns, microarray data and the 
scientific literature, should undergo more 
detailed and system -focused Tier 2 phenotyp- 
ing. Tier 2 phenotyping should be done in 



specialized phenotyping centers, akin to those 
already in operation for phenotyping of mice 
produced by ENU mutagenesis. All Tier 1 and 
Tier 2 phenotyping should be done on a uni- 
form genetic background by dedicated groups 
of individuals in single locations, to facilitate 
consistency and cross-comparison of results 
among different mouse lines. All Tier 1 and 
Tier 2 phenotyping results should be 
deposited into a central project database freely 
accessible to the research community. More 
detailed and specialized phenotyping could be 
done by individual researchers in their own 
laboratories; deposition of this more detailed 
phenotype data would be encouraged. 

Transcriptome analysis. Transcriptome 
profiling of tissues from each knockout line, 
collected in a uniform way across all mice and 
tissues and placed into a searchable relational 
database, would add substantially to the sci- 
entific value of the project, though it would 
also add considerably to its cost. 
Transcriptome analysis should therefore be 
done on a subset of mice, chosen by peer 
review. We estimate that, with the best cur- 
rently available array technology, an analysis 
of ten tissues would cost ~$ 1 8,000 per line. 

Conclusions 

This project, tentatively named the Knockout 
Mouse Project (KOMP), will be a crucial step 
in harnessing the power of the genome to 
drive biomedical discovery. By creating a 
publicly available resource of knockout mice 
and phenotypic data, KOMP will knock 
down barriers for biologists to use mouse 
genetics in their research. The scientific con- 
sensus that we achieved — that a dedicated 
project should be undertaken to produce 
mutant mice for all genes and place them 
into the public domain — is important but is 
only the beginning. Implementation of these 
recommendations will require additional 
input from the greater scientific community, 
including those responsible for program- 
matic direction and financial support of bio- 
medical research in the public and private 
sectors. This ambitious and historic initiative 
must be carried out as a collaborative effort 
of the worldwide scientific community, so 
that all can contribute their skills, and all can 
benefit. International discussions among sci- 
entific and programmatic staffs since the 
Banbury meeting at Cold Spring Harbor, in 
both the public and private sectors, have 
shown that there is great enthusiasm and 
commitment to this vision. The next step for 
KOMP will be to move this visionary plan 
from conceptualization to implementation, 
with an urgency befitting the benefits it will 
bring to science and medicine. 
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URLs. The curatcd Mouse Knockout & Mutation 
Database is available at http://research.bmn.com/ 
mkmd/. The curated Mouse Genome Database is 
available at http://www.informatics.jax.org/. Patent 
pools: A solution to the problem of access in biotech- 
nology patents? is available at http://www.uspto. 
gov/web/offices/pac/dapp/opla/patentpool.pdf. 
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the target gene in a particular place or at a particular time. The most common of 
these recombination systems called Cre/lox, is widely used to engineer gene 
replacements in mice and in plants (see Figure 5-82). In this case the target gene 
in ES ceils is replaced by a fully functional version of the gene that is Banked by 
a pair of the short DNA sequences, called lox sites, that are recognized by the 
Cre recorabinase protein. The transgenic mice that result are phenotypicaliy 
normal. They are then mated with transgenic mice that express the Cre recom- 
binase gene under the control of an inducible promoter. In the specific cells or 
tissues in which Cre is switched on, it catalyzes recombination between the lox 
sequences— excising a target gene and eliminating its activity. Similar recombi- 
nation systems are used to generate conditional mutants in Drosophila (see 
Figure 21-48). 
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ES cells growing 
In tissue culture 



(B) female mouse 
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Figure 8-70 Summary of the 
procedures used for making gene 
replacements in mice. In the first step' 
(A), an altered version of the gene b 
introduced Into cultured ES (embryonic ''■ 
stem) cells. Only a few rare ES cells will 
have their corresponding normal genes 
replaced by the altered gene through a 
homologous recombination event 
Although the procedure is often laborious 
these rare cells can be identified and 
cultured to produce many descendants, 
each of which carries an altered gene in 
place of one of its two normal 
corresponding genes. In the next step of 
the procedure (B), these altered ES cefe 
are injected into a very earty mouse 
embryo: the cell* are incorporated into 
the growing embryo, and a mouse 
produced by such an embryo will contain 
some somatic cells (indicated by orange) 
that carry the altered gene. Some of these 
mice will also contain germ-line cells that 
contain the altered gene. When bred wid» 
a normal mouse, some of the progeny of 
these mice will contain the altered gene h 
all of their cells. If two such mice arc in 
turn bred (not shown), some of the 
progeny will contain two altered genes 
(one on each chromosome) in all of their 
cells. 

If die original gene alteration complete? 
inactivates the function of the gene, these 
mice are known as knockout mice. When 
such mice are missing genes that function 
during development, they often die with 
specific defects long before they reach 
adulthood. These defects are carefulh; 
analyzed to help decipher the normal 
function of die missing gene. 
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f frjnsgenic Plants Are Important for Both Cell 
0dg/ and Agriculture . 

jyjjgn a plant is damaged, it can often repair itself by a process in which mature 
jjjfleientiated cells, "dedifferentiate," proliferate, and then redifferentiate into 
ol j 1 ercell types. In some circumstances the dedifferentiated cells can even form 
gjj apical raeristera which can then give rise to an entire new plant, including 
^metes. This remarkable plasticity of plant cells can be exploited to generate 
JjaflSgenic plants from cells growing in culture. 

\Vhen a piece of plant tissue is cultured in a sterile medium containing 
nutrients and appropriate growth regulators, jnany of the cells are stimulated to 
proliferate indefinitely in a disorganized manner, producing a mass of relatively 
undifferentiated cells called a callus. If the nutrients and growth regulators are 
gallfly manipulated, one can induce the formation of a shoot and then root 
apical meristems within the callus, and, in many species, a whole new plant can 
be regenerated. 

Callus cultures can also be mechanically dissociated into single cells, which 
wll grow and divide as a suspension culture. In several plants — including 
tobacco, petunia, carrot, potato, and Arabidopsis — a single cell from such a sus- 
pension culture can be grown into a small clump (a clone) from which a whole 
plant can be regenerated, Such a cell, which has the ability to give rise to all parts 
of the organism, is considered totipotent Just as mutant mice can be derived by 
genetic manipulation of embryonic stem cells in culture, so transgenic plants 
can be created from single totipotent plant cells transferred with DNA in culture 
(Figure 8-72). 

The ability to produce transgenic plants has greatly accelerated progress in 
many areas of plant cell biology. It has had an important role, for example, in iso- 
lating receptors for growth regulators and in analyzing the mechanisms of mor- 
phogenesis and of gene expression in plants. It has also opened up many new 
possibilities in agriculture that could benefit both the farmer and the consumer. 
It has made it possible, for example, to modify the lipid, starch, and protein stor- 
age reserved in seeds, to impart pest and virus resistance to plants, and to create 
modified plants that tolerate extreme habitats such as salt marshes or water- 
stressed soil. 

Many of the major advances in understanding animal development have 
come from studies on the fruit fly Drosophila and the nematode worm 
Qmorhabditis elegans, which are amenable to extensive genetic analysis as well 
as to experimental manipulation. Progress in plant developmental biology has, 
in the past, been relatively slow by comparison. Many of the plants that have 
proved most amenable to genetic analysis— such as maize and tomato— have 
long life cycles and very large genomes, making both classical and molecular 
genetic analysis time-consuming. Increasing attention is consequendy being 
paid to a fast-growing small weed, the common wall cress (Arabidopsis 
ihalUma). which has several major advantages as a "model plant' (see Figures 
M6 and 21-107). The relatively small Arabidopsis genome was the first plant 
genome to be completely sequenced. 

Large Collections of Tagged Knockouts Provide a Tool for 
Hamming the Function of Every Gene in an Organism 

Extensive collaborative efforts are underway to generate comprehensive libraries 
of mutations in several model organisms, including 5. cerevisiae, C. elegaris, 
brosophila* Arabidopsis, and the mouse. The ultimate aim in each case is to pro- 
duce a collection of mutant strains in which every gene in the organism has 
After been systematically deleted, or altered such that it can be conditionally 
T^pted. Collections of this type will provide an invaluable tool for investigat- 
es gene function on a genomic scale. In some cases, each of the individual 
Mutants within the collection will sport a distinct molecular tag— a unique DNA 
^ence designed to make identification of the altered gene rapid and routine. 
1 In & cerevisiae, the task of generating a set of 6000 mutants, each missing 




Figure 8-7 1 Mouse with an 
engineered defect in fibroblast 
growth factor 5 (R3F5). FGF5 h a 
negative regulator of hair formation. In a 
mouse lacking FGF5 (rigfit), the hair b long 
compared with its heterozygous Ihxermate 
(7eft). Transgenic mice with phenotypes 
that mimic aspects of a variety of human 
disorders, including Alzheimer's disease, 
adierosderosb, diabetes, cystic fibrosis, 
and some type of cancers, have been 
generated. Their study may lead to the 
development of more effective treatments. 
(Courtesy of Gail Martin, from J.M. Hebert 
et at, OB 78:1017-1025. 1994.© Elsevier.) 
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Genes VII, p.508 (Lewin, Oxford University Press (2000)) 
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which exogenous DNA is introduced from a bacterium 
into a host cell. The mechanism resembles that of bac- 
terial conjugation. Expression of the bacterial DNA in 
its new host changes the phenotype of the cell. In the 
example of the bacterium Agtvbacterium tumejacitn^ 
the result LS to induce tumor formation by an infected 
plant cell. 

Alterations in the relative proportions of compo- 
nents of the genome during somatic development 
occur to allow insect larvae to increase the number of 
copies of certain genes. And the occasional amplifica- 
tion of genes in cultured mammalian cells is indicated 
by our abfflty to select variant cells with on increased 
copy number of some genes. Initiated within the 
genome, the amplification event can create additional 
copies of a gene that survive in either intrachromoso- 
rnal or cxtrachromosomal form. 

When extraneous DNA is introduced into ewfcary- 
otic cells, it may give rise to cxtrachrornosomal forms 
or may be integrated into the genome. The relationship 
between the extrachromosomal and genomic forms is 
irregular, depending on chance and to some degree un* 
predictable events, rather than resembling the regular 
interchange between free and integrated forms of bac- 
terial plasmids. 



Vet, however accomplished, the process may lead to 
stable change in the genome; following its injection 
into animal eggs. DNA may even be incorporated into 
the genome and inherited thereafter as a normal com* 
ponent> sometimes continuing to function. Injected 
DNA may enter the germlinc as well as the soma, ere«t 
ing a transgenic animal. The ability to introduce spe- 
cific genes that function in an appropriate rnamiei 
could become a major medical technique for curin* 
genetic diseases. 6 

The converse of the introduction of new genes is 
the ability to disrupt specific endogenous genes 
Additional DNA can be introduced within a gene to 
prevent its expression and to generate a null allele 
Breeding from an animal with a null allele can generate 
a homozygous "knockout* which has no active copy of 
the gene. This is a powerful method to investigated^ 
reedy the importance and function of a gene. 

Considerable manipulation of DNA sequences 
therefore is achieved both in authentic situations and 
by experimental fiat. We are only just beginning to 
work out the mechanisms that permit the cell to re- 
spond to selective pressure by changing its bank of se- 
quences or that allow it to accommodate the intrusion 
of additional sequences. 



The mating pathway is triggered by signal 
transduction 



*T*He, yeast & cere wiac can propagate happily in ei- 
X thcr the haploid or diploid condition. Conversion 
between these states takes place by mating (fusion of 
haploid spores to give a diploid) and by spoliation 
( meiosis of diploids to give haploid spores). The a bility 
to engage in these activities is determined by the mat- 
ing type of the strain. 

The properties of the two mating types arc sum- 
marised in Figure 17.1. We may view them as rest- 
ing on the Ideological proposition that there is no 
point in mating unless the haptoids are of different 
genetic types; and sporulation is productive only 
when the diploid is heterozygous and thus can ecu- 
eratc recombinants. 

The mating type of a (haploid) cell is determined by 
the genetic information present at the AMTlocus. Ceils 
that carry the MATa allele at this locus are type a: like- 



wise, cells that carry the MATa allele are type fx Cells 
of opposite type can mate; cells of the same type 
cannot. 

Recognition of cells of opposite mating type is 
Figure 17-1 Mating type controls several activities. 
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Preface 



Over the pasi ten years it has become possible to make essentially any muta- 
tion in the gennlinc of mice by utilizing recombination and embryonic stem 
(ES) cells. Homologous recombination when applied to altering specific 
endogenous genes, referred to as gene targeting, provides the highest level of 
control over producing mutations in cloned genes. When, this is combined with 
site specific Tecombinauon, a wide range of mutations can be produced. ES cell 
lines are remarkable since after being established from a blastocyst, they can 
be cultured and manipulated relatively easily in vitro and still maintain their 
ability to step back into a normal developmental program when returned to a 
prc-implantation embryo. With the exponential increase in the number of 
genes identified by various genome projects and genetic screens, it has become 
imperative thai efficient methods be developed for determining gene function. 
Gene targeting in ES cells of fers a powerful approach to study gene function in 
a mammalian organism. Gene trap approaches In cclK in particular when 
Uicy are combined with sophisticated prcscreens, offer not only a route to gene 
discovery, but also to gain information on gene sequence, expression and 
mutant pbenotype. 

The bask technology necessary for making designer mutations ui mice has 
become widespread and researchers who have traditionally used cell biology 
or molecular experiment* arc adding gene targeting techniques to their reper- 
toire of experimtal approaches. A second edition of this book was written for 
two main reasons. The first was to update previously described techniques and 
to add new techniques that have greatly expanded the types of mutations that 
can be made using recombination in ES cells. A chapter in this new edition 
describes the design and use of site specific recombination for gene targeting 
approaches and production of conditional mutations, lire second reason foi 
the new book was to provide a more in depth discussion of the experimental 
design considerations thai are critical to a successful gene targeting SWdy and 
to add approaches for analyzing mutant phenolypcss the most interesting 
part of an experiment. Gene targeting experiments should be designed to 
go far beyond just making a mutant mouse. The success of a gene targeting 
experiment no longer lies in the making of the mutation, but depends on the 
imaginative and insightful analysts of the mutant phenotypes that the mutauon 
provides. A chapter in this edition describes the use of classical genetics in 
combination with gene targeting to get the most out of ja genetic approach to a 
biological question. 

The nature of in vivo gene targeting studies of gene function are such that 
critical design decisions must be made at every step ia the experiment, and 
each decision can have a major impact on the value of the information 
obtained. From the start, the type of mutauon to be made must be considered 



Preface 



carefully. Whereas 10 years ago most mutations were designed to create null 
mutations and were therefore relatively simple to design, at present, a null 
mutation is only one of a long list of mutations that can be made, each provid- 
ing different insight into the function of a gene. Point mutations, large dele- 
tions, gene exchanges (knock-ins) and conditional mutations are but a few of 
the choices one faces at the start of a gene targeting experiment. The next 
choice is the source of DN A for the targeting experiment and ES cell line to he 
used for the manipulations. Once the mutant BS cell clone has been obtained, 
there are then a number of alternative approaches that can be used to make 
ES cell chimeras that depend on the ES cell line which was used Finally, and 
most Importantly, is the analysis of any phcaotypc that arises. This second 
addition discusses techniques used to analyze mutant mke, ranging from Man- 
dard descriptive evaluation, to a chimera analysis or complicated breeding 
experiments that utilize double mutants, Jf mice arc simply considered as a 
4 bag of cells* or an ui vivo source of selected ceU types, then the tremendous 
resource which mice offer as a model organism is not being realized. Tnc life of 
a mouse represents a continuum of dynamic processes, including pattern for- 
mation, organ development teaming, homeostasis and disease. By making 
genetic alterations in mice using gene targeting and ES cells, the effects of a 
given change can be studied in the context of the whole organism. 

My goal in editing this book was to provide a manual that could take a new- 
comer to the exciting field of gene targeting and mutant analysis in mice from a 
doned gene to a basic understanding of the genetic approaches available using 
ES cells, and how each technique can be used 10 design a particular in vivo tesi 
of gene function. The book should also provide a valuable bench side resource 
for anyone canning out gene targeting or gone trap experiments, a chimera 
analysis or classical genetic approaches. I would once again like to extend 
many thanks and my deepest appreciation to all the authors for their greal 
efforts in Including detailed protocols and lucid discussions of the various 
approaches presented. I would also like to thank my family for their strong 
support and laboratory members past and present for helping to make gene 
targeting a reality. Finally, since many of the techniques use mke, the experi- 
ments should be carried out in accordance with local regulations. 

New York, NY A.LJ. 
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Gene targeting, principles, and 
practice in mammalian cells 

PAUL HASTY. ALEJANDRO ABUIN, awl A.ULAN BRADLEY 



1. Introduction 

When a fragment of genomic DNA is introduced Into a mammalian cell it can 
locate and rccombinc with the endogenous homologous sequences. This type 
of homologous recombination, known as gene Uingeting, is the subject of this 
chapter. Gene targeting has been widely used, particularly in mouse em- 
bryonic stem (ES) eclb, to make a variety of mutations in many different loci 
so that the phenotypte consequences of specific genetic modifications can be 
assessed in the organism. 

The first experimental evidence for the occurrence of gene targeting in 
mammalian cells was made using a fibroblast cell line with a selectable 
artificial locus by Lin et ai (I), and was subsequently demonstrated to occur at 
the endogenous p-globin gene by Smithies tt ai in erylhroleukacmia cells (2), 
In general, the frequencies of gene targeting in mammalian cells are relatively 
low compared to yeast cells and this is probably eclated to, at least in part, a 
competing pathway: efficient integration of the transfected DNA into a ran- 
dom dtrumosoraal ate, The relative ratio of targeted to random integration 
events will determine the case with which targeted clones arc identified in a 
gene urges ing experiment, litis chapter details sispects of vector design which 
can determine the efficiency of recombination, tbc type of mutation that may 
be generated in the target locus, as well as the selection and screening 
strategies which can be used lo identify clones of ES cells with the desired 
targeted modification. Since the most common experimental strategy is to 
ablate the function of a target gene (null allele) by introducing a selectable 
marker gene, wc initially describe the vectors and the selection schemes which 
arc helpful in the identification of recombinant clones (Sections 2-5). In 
Section 6. wc describe the vectors and additional considerations for gener- 
ating subtle mutations in a target locus devoid of any exogenous sequences. 
Franiry, Section 7 is dedicated to the use of geme targeting as a method to 
express exogenous genes from specific endogenous regulatory elements in 
vivo, also known as 'knock-in' strategies. 



j i Gene targeting, principhs* and practice in mammalian cells 
enrich populations of transfectcd cells for targeted integration events (Section 
4.2.1). 

2,1 Design considerations of a replacement vector 

the principal consideration in the design of a replacement vector, is the type 
of mutation generated. Secondary (yet stiU important) considerations relate 
to the selection scheme and screening techniques required to isolate the re- 
combinant clones. The recombinant alleles generated by replacement vectors 
typically have a selection cassette inserted into a coding exon or replacing part 
of the locus. It is important to consider that, exon Interruptions and small 
deletions will not necessarily ablate the function of the target gene to generate 
a null allele. Consequently, it is necessary to confirm that the allele which has 
been generated in null by RNA and/or protein analysis and in many cases 
transcripts and truncated proteins from such a mutant allele can be detected. 
Considering that products from the mutated tocus may have some function 
(normal or abnormal) it is important to design a replacement vector so that 
the targeted allele is null, particularly in the absence of a good assay for the 
gene product. Disruption or deletion of the coding sequence by the positive 
selection marker will in most instances ablate a gene's function. However in 
some situations a truncated protein may be generated! which retains some 
biological activity, thus some knowledge of mutations in a related gene in 
another organism can be helpful in the determination of the possible function 
of a targeted allele. Null alleles are more likely to occur by deleting or 
rccombinuig a selection cassette into more 5' exons rather than exons that 
encode the C-tcrminus of the protein, since under these circumstances 
minimal portions of the wild-type polypeptide would be made. 
I There arc several eortsidc rations to take into account when a positive 
\ selection marker is to be inserted into an exon. One critical consideration is 
\ that since the length of an exon art influence RNA splicing (3), an artificially 
| large exon caused by the insertion of a selectable marker may not be 
I recognized by the splicing machinery and could be skipped. Thus, transcripts 
£ initiated from the endogenous promoter may delete the mutated exon from 
1 the mRNA specks or even additional exons. If a skipped exon is a coding 
I exon whose nucleotide length is not a multiple of three (codon) the net result 
j will be both a deletion and a frame-shift mutation of the gene, which will 
J often generate a null allele. However, if the disrupted coding exon has a 
j nucleotide length which is a multiple of three, if spliced out, this would result 
§ in a protein with a small in-frame deletion which may retain partial or con> 
5 plirte function- The same concept applies to gene targeting vectors in which 
1 exons arc being deleted and replaced by the selectable marker. Deletion of an 
I exon or group of exons with a unit number of oodons may also result in a 
: functional protein product with an in-frame deletion. For most purposes it is 
\ advisable to delete portions or all of the target gcrte so that the genetic 
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Joyner, Gene Targeting: A Practical Approach, Oxford University Press 2000) 
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Production of targeted embryonic 
stem cell clones 

MICHAEL P. M ATI Sis, WOJTEK AUERBACH and ALEXANDRA L. 
JOYNER 

1, Introduction 

The discovery that cloned DNA introduced into tissue culture cells can 
undergo homoJogous recombination at specific chromosomal loci has 
revolutionized our ability to study gene function in celt culture and in vivo. In 
theory* this technique, termed gene- targeting* allows one to generate any type 
of mutation in any cloned gene. The kinds of mutations that can be created 
include null mutations, point mutations, deletions of specific functional 
domains, exchanges of functional domains from related genes, and gain-of* 
function mutations in which exogenous cDNA sequences arc inserted 
adjacent to endogenous regulatory sequences. In principle , such specific 
generic alterations can be made in any cell line growing in culture. However, 
not aU cell types can be maintained in culture under the conditions necessary 
for (ransfection and selection. Over ten years ago, pluripotcnt embryonic 
stem (ES) cells derived from the inner cell mass (ICM) of mouse blastocyst 
stage embryos were isolated and conditions defined for their proposition and 
maintenance in culture (1, 2). ES cell* resemble ICM cells tn many respects, 
including their ability to contribute to ail embryonic tissues b chimeric mice. 
Using stringent culm re conditions, the embryonic developmental potential of 
ES cells can be maintained following genetic manipulations and after many 
passages in vitro. Furthermore, permanent mouse tines carrying genetic alter- 
ations introduced into ES cells can be obtained by transmitting the mutation 
through *hc gcrmlmc by generating ES cell chimeras (described in Chapters 4 
and 5). Thus, applying gene targeting technology to ES oclls in culture affords 
researchers the opportunity to modify endogenous genes and study their 
function in vivo. In initial studies, one of the main challcmges of gene targeting 
was to distinguish the rare homologous recombination events from more com- 
monly occurring random integrations (discussed in Chapter t). However, 
advances tn cell culture and in selection schemes,, in vector construction using 
isogenic DNA, and in the application of rapid screening procedures have 
made it possible to identify homologous recombination events efficiently. 
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Crawley, What's Wrong With My Mouse Behavioral Phenotyping of Transgenic and 
Knockout Mice, p.l, rear cover, Wiley-Liss 2000) 
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Preface 



Turned mutation of genes expressed m the nervous system is on exciting new research 
field tha! is forging a remarkable a mail gam of molecular genetics and behavioral ncuro- 
science. My laboratory in Bethesda has been ihe fortunate recipient of visits from many 
molecular geneticists over Ihe past five years, who come to ask, "What 1 * wrong with my 
mouse? Can you tell us what behaviors are abnormal in our null mutants? And how do you 
measure behavior, anyway?" 

We have had some remarkable opportunities to collaborate with outstanding molecular 
geneticists in the National Institutes of Health Intramural Research Program and through- 
out live world ofi ifivcstigatiow of the behavioral effects of mutations id genes expressed Ln 
the mouse brain. Each of these collaborations has been a learning experience, increasing 
our understanding of the optimal experimental design for analyzing behavioral phenotypes 
of mutant mice, What arc the best testa to address each specific hypothesis? Which meth- 
ods work best for mice? Which rat tasks can be adapted for mice? What are the correct con- 
trols? What are the hidden pitfalls, lurking artifacts, false positives, and false negatives? 
Which statistical tests are most sensiti ve for detection of Ihe genotype effect? What is the 
minimum member of animals necessary for each genotype, gender, and age? Our laboratory 
and many others arc gradually working out the best methods for behavioral pbenotyping of 
transgenic and knockout mice. 

In the same conversations, molecular geneticists frequently asked me to recommend a 
book the)- could consult to learn more about behavioral tests fix mice. Apparently the sci- 
entific hook publishers arc receiving simitar queries. Ann Boyle and Robert Harirtgton at 
John WUcy & Sons, convinced of a real need for such a book, sweet-talked me bio filling 
the void. What's Wrong Wiih My Mouse? is written for these moncering molecular geneti- 
cists, and for the talented students who will be the next driving force in moving the field 
forward. 

On a personal level, 1 would like to express deep appreciation to all of my behavioral 
neuroscten&t colleagues around the world for their outstanding work, past, present and fu- 
ture. Your contributions to the excellence and abundance of mouse behavioral tests provide 



the foundation for the mjwdJy expanding scientific discoveries forthcoming from behavioral 
phenoryphnj studies of transgenic and knockout mice. This book is a testament to your ac- 
complishments. 



Jacqueline N. Crawuy, PhD. 

Cktvy Chair. Mary W 
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Designer Mice 



The disease is inherited. Family pedigrees indicate an autosomal dominant gene. Linkage 
analyses reveal one strongly associated chronbosomal locus. Mapping identifies the gene. 
The cDKA for the gene is sequenced. The anatomical distribution of the gene is primarily 
in the brain. The symptoms of the disease ore primarily ocuropsychiatric. There is no treat- 
ment for the disease. The disease is lethal. 

Your mission, should you choose to accept it, is to develop a treatment for the disease. 
Replacement gene therapy is the best hope. But you don't know the gene product, you 
donH know sis function, and you don't kaow if gece delivery would be therapeutic. Where 
do you start? 

These (fays, you may choose to start with a targeted gene rnuuuioix to generate a mutant 
mouse model of the hereditary disease. A DNA construct containing the mutated form of the 
responsible gene is developed. Tne construct as inserted into the mouse genome. A line of 
mice with the mutated gene is generated. Oiaxucteristics of the mutant mice arc identified in 
comparison to normal controls. Salient characteristics relevant to the human disease are quan- 
titatcd. These diseaselike traits arc then used as. test variables for evaruatmg the effectiveness 
of treatments. Putative treatments are administered to the mutant mice, A treatment that pre- 
vents or reverses the disease traits in the mutant mice b taken forward for further testing as a 
potential thetaneii^ Oene therapy, based on targeted 

gene replacement of the missing or incorrect gcoe in the human here4rtary disease, is de- 
scribed in Chapter 12. mmcruture,medkrjicrnayshift 

to admmfatering replacement genes that ctTectrvcry and permanently cure the disease. 

Targeted gene mutation in mke represents a new technology that is revolutionizing bio- 
medical research. Transgenic mice have an extra gene added. An additional copy of a nor- 
mal gene is inserted into the mouse genome to- study overexprcssicra of the gene product Or 
a new gene is added that is not normally present in the mouse genome. The new gene may 
be the aberrant form of a human gene linked to a disease. For example, the mutated form of 
the human huruingitn gene is added to the mouse genome to generate a mouse model of 
Huntington's disease. Knockout mice have a gene det&td- The null mutant homozygous 

T 
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knockout moose is deficient in both alleles of a gene; the heteroxygotc is deficient in one 
of its two alleles for the gene. The genotype is for the null mutant, +/- for the het- 
erozygotc, and +/+ lor the wlldlvpe normal control. The phenotype is the set of ob- 
served characteristics resulting from the mutation. Ptic do types include biochemical, 
anatomical, physiological and behavioral characteristics. 

Targeted mutations of genes expressed i n the brain arc rcxealing the mechanisms under- 
lytofl normal behavior and behavioral abnormalities. Moose models of human neuropgy- 
chiatrlc diseases, such as Alzheimer's disease* Parkinson* disease, Huntington s disease, 
amyotrophic lateral sclerosis, obesity, anorexia, depression, alcoholism, drug addiction, 
schizoprm*ma, and anxiety, arc likely to be characterized by their behavioral ptaertotype. 

This book is designed to introduce the novice to the rich literature of behavioral tests in 
mice and to show how to optimize me application of these tests for behavioral phenotyping 
of mutant mke, Based on our experiences,, our laboratory is working toward a unified ap- 
proach for the optimal conduct of behavioral phenotyping experiments in mutant mice. 
Recommendations are offered for a three-tiered sequence of behavioral teste, applicable to 
each behavioral domain relevant to genes expressed in the mammalian brain. 



SCOPE 

This book is designed as an overview of the mutant mouse technology and an mtroducion 
to the field of behavioral ncutoscience, as it can be applied to behavioral phenotyping of 
transgenic and knockout mice. Molecular geneticists may browse through the chapters rel- 
evant to their gene, to get ideas for possible tests to try Behavioral neunwetcntists who have 
no experience with mutant mice may wish to read about the methods for developing a trans- 
genic or knockout the behavioral tests that have been effectively applied and some of the 
successful experiments published in the genetics literature.. 

Chapters are organized around behavioral domains, including general health, neurolog- 
ical reflexes, developmental milestones, mote* functions, seftsory abilities, learning and 
memory, feeding, sexual and parental behaviors, social behaviors, and rodent paradigms 
relevant to fear, anxiety, depression, schizophrenia, reward, and drug addiction. Each chap- 
ter begins with a brief history of the early work in the field and the present hypotheses about 
mechanisms underlying the expression of the behavior. A list of general review articles and 
books is offered for each topic, encouraging the interested reader to gain more in-depth 
knowledge of the relevant literature. 

Standard tests arc men presented in detail. Highlighted are those tasks that have been ex. 
tensiwery validated in mice. Demonstrations of genetic components of task performance are 
described, including experiments comporing inbred strains of mice (strain distributions), 
Quantitative trait loci approaches (linkage analysis), and naturally occurring mutants (spon- 
taneous mutations). Experimental design and specific behavioral tasks are presented as 
simply as possible. Extensive references are included for each behavioral test to obtain 
more complete methods from the primary experimental literature on the lopkk 

Illustrations are provided for the most frequently used behavioral tasks. Photographs of 
the equipment or diagrams of the task accormpany the text. Samples of data are shown. The 
data presentation is designed to intficare the qualhative and quantitative results that can be 
expected when the task is properly conducted, 

Each chapter includes the results of several representative experiments in trtucb these 
tasks are success fairy applied to characterise transgenic and knockout mice. Examples arc 
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Transgenic and knockout mutations provide an important means for understanding gene j 

function, as wet! as for developing therapies for genetic diseases. This engaging and informative : K 

book discusses the many advances in the field of transgenic technology that have enabled : [ 

researchers to bring about various changes in the mocuse genome. Equal emphasis Is given to '. I 

both the principles of transgenic and knockout methods and ttieir applications. A clear and 1 t 

concise format provides researchers with a comprehensive review of the behavioral paradigms • 

appropriate for analyzing mouse phenotypes. \ 

What's Wrong wilh My Mouse? explains the differences between transgenic knockout mice and f 

their wild-type controls, while providing critical information about gene function and expres- I f 

sion. This volume recognises that newly identified genes Can provide useful insights into brain : \ 

functioning, including brain malfunctioning in disease states. Written by a world-renowned ' S : 

expert In the field, the material also covers: ' 1 

• How to generate a transgenic or knockout mouio 

• Motor functions (open field, holeboard, rota rod, balance, grip, circadian activity, etc.) 

• Sensory abilities (olfaction, vision, hearing, taste. tOOJCh. nociception) 

• Reproductive behavior, social behavior, and emotional behavior : j 

; j 

Researchers rn neuroscience. pharmacology, genetics, developmental biology, and cell biology I 

will all find this book essential reading. j 
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Doetschman, Laboratory Animal Science 49:137-143, 137 (1999) 
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Special Topic Overview 

Interpretation of Phenotype in 
Genetically Engineered Mice 



Thomas Doetschman 

Background and Purpose: In mice, genetic engineering involves two general approaches— addition of an ex- 
ogenous gene, resulting in transgenic mice, and use of knockout mice, which have a targeted mutation of an 
endogenous gene. The advantages of these approaches is that questions can be asked about the function of a 
particular gene in a living mammalian organism, taking into account interactions among cells, tissues, and 
organs under normal, disease, injury, and stress situations. 

Methods: Review of the literature concentrating principally on knockout mice and questions of unexpected 
phenotypes, lack of phenotype, redundancy, and effect of genetic background on phenotype will be discussed. 

Conclusion: There is little gene redundancy in mammals; knockout phenotypes exist even if none are imme- 
diately apparent; and investigating phenotypes in colonies of mixed genetic background may reveal not only 
more phenotypes, but also may lead to better understanding of the molecular or cellular mechanism underly- 
ing the phenotype and to discovery of modifier gene(s). 



One often hears the comment that genetically engineered 
mice, especially knockout mice, are not useful because they 
frequently do not yield the expected phenotype, or they dont 
seem to have any phenotype. These expectations are often 
based on years of work, and in some instances, thousands of 
publications of mostly in vitro studies. Examples of unex- 
pected phenotypes, based largely on experience with trans- 
forming growth factor beta (Tgfb) and basic fibroblast 
growth factor (Fgf2) knockout and transgenic mice, will be 
presented to discuss possible reasons for unexpected knock- 
out phenotypes. The conclusions will be that the knockout 
phenotypes do, in fact, provide accurate information con- 
cerning gene function, that we should let the unexpected 
phenotypes lead us to the specific cell, tissue, organ culture, 
and whole animal experiments that are relevant to the func- 
tion of the genes in question, and that the absence of pheno- 
type indicates that we have not discovered where or how to 
look for a phenotype. 

Before entering into how one should interpret unexpected 
knockout phenotypes and how one should deal with lack of 
knockout phenotypes, it is necessary to give a brief introduc- 
tion into how knockout mice are made. For detailed informa- 
tion, the following reviews are suggested (1-4). Transgenic 
technology has had a long history; thus, an introduction to 
that technology will not be given here. Rather, the following 
reviews are suggested (5, 6). At this juncture, it should be 
noted that, although transgenic vertebrates ranging from 
fish to bovids have been produced, knockout technology has 
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to date been successful only in mice, even though embryonic 
stem (ES) cells have been produced from several other spe- 
cies, including hamster (7), rat (8), rabbit (9, 10), pig (11-13), 
bovine (14, 15), and zebrafish (16). Consequently, the entire 
discussion will be focused on mice. 

Knockout mice are generated by the injection of geneti- 
cally engineered or gene-targeted ES cells into a mouse blas- 
tocyst to generate a chimeric embryo, which in turn can pass 
on the engineered gene to its offspring. ES cell lines are es- 
tablished from the inner cell mass of a mouse blastocyst, so 
that when injected into blastocysts, the ES cells can incor- 
porate into the inner cell mass of the recipient blastocysts, 
thereby chimerizing them. Subsequent to transfer of the chi- 
meric blastocysts into uteri of pseudopregnant mice, chi- 
meric mice are born. If the germline of a chimeric mouse is 
colonized by cells derived from the injected ES cells, the chi- 
mera is termed a "germline" chimera. Some of the offspring 
of the germline chimeras will then carry the engineered 
gene in their genomes. Gene targeting in ES cells uses the 
ES cells' DNA repair apparatus to bring about homologous 
recombination between an exogenous DNA fragment trans- 
fected into the ES cell and its homologous region in the ge- 
nome. Homologous recombination usually results in 
replacement of the endogenous region with the exogenous 
fragment, thereby altering the endogenous gene in a 
prespecified manner. There are many variations on this pro- 
cedure by which genes can be altered not only to ablate func- 
tion, but also to make more subtle mutations (17-19). Such 
procedures can be used to introduce point mutations, re- 
move specific splicing products, switch isoforms, and human- 
ize genes. In addition, technology has recently been 
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developed to make conditional and inducible knockouts in 
which gene function is ablated either in a developmentally 
specified tissue (20-22) or in an inducible manner (23-26). 
These techniques, though exciting, will not be further dis- 
cussed • 

Extensive nonredundancy in the TGFp family: Sev- 
eral thousand cell culture studies on the three mammalian 
transforming growth factor beta proteins (TGFps 1, 2, and 3) 
have implicated these growth and differentiation factors in 
the function of nearly every cell type studied. Expression 
studies indicated unique and overlapping expression of the 
three TGFps (27, 28). For example, overlapping protein local- 
ization was found in all gut epithelia, all layers of the skin, all 
three muscle types, kidney tubules, lung bronchi, cartilage, and 
bone (Table 1). Together with the fact that all three TGFps sig- 
nal through a common TGF type-II receptor (Figure 1), these 
data strongly suggest considerable redundancy in function. 
Consequently, it is surprising that, of the >30 phenotypes of the 
three Tgfb knockout mice that we have described (29-31), none 
appear to be overlapping (Table 2). These results indicate ex- 
tensive nonredundancy between TGFp ligands even though 
there is considerable overlap in expression. Of course, these re- 
sults do not rule out the possibility of some redundancy in some 
tissues. Combination of the ligand knockouts would uncover 
such situations, and it is likely that a few will exist, but 30 non- 
overlapping phenotypes for three ligands strongly suggests 
that a vast number of their functions are not redundant. 

There are several possible explanations for how there can 
be so much overlap in ligand expression and yet so much 
specific ligand function. First, TGFps are secreted as latent 
peptides and must be activated before they can bind recep- 
tors (32-35). The mechanism by which this extracellular 
processing occurs is not well understood and may be differ- 
ent for each TGFp. Hence, ligand processing presumably de- 
termines some functional specificity for the three TGFps. 
Second, there is a third type of TGFp receptor, TGFpR3, that 
can interact with ligand and receptor types I and II before 
cytoplasmic signaling can occur, though involvement of 
TGFPR3 is not essential for signaling (36-38). Association 
with type III receptors is thought to enhance some TGFpRl 
and 2/ligand interactions. Upon ligand binding, the serine/ 
threonine receptor TGFpR2 then associates with and phos- 
phorylates the transmembrane serine/threonine receptor 
TGFpRl, which in turn initiates a phosphorylation-medi- 
ated signaling cascade. Hence, combinatorial receptor/ligand 
interactions will also determine functional specificity Third, 
signaling from TGFpRl can occur through two cytoplasmic 
signaling proteins called SMAD2 and 3 (39, 40) and, per- 
haps, through a third called SMAD5 (41). In addition, 
SMAD6 and 7 can also interact with the other SMADs to in- 
hibit signaling (42-44). Hence, differential SMAD protein 
interactions with transcriptional machinery will probably 
also determine functional specificity for the three TGFp 
ligands. Finally, there may be several non-transcriptional 
signaling pathways for TGFps. For example, we have found 
that TGFpi-deficient platelets from Tgfbl knockout mice 
have impaired platelet aggregation that can be restored by 
incubating isolated platelets with recombinant TGFpi (un- 
published observations). Because platelets do not have a 
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The polyclonal antiooaies usea were spetuiu lcaiuu " u 1 — 

and 2 and residues 9-20 of TGF(33. The avidin-biotin system was used 
for staining. Data obtained from immunohistochemical study of Pelton 
et al. (28). Reproduced from The Journal of Cell Biology, 1991,115:1091- 
1105, by copyright permission of The Rockefeller University Press. 

nucleus, there must exist a signaling pathway that is 
nontranscriptional. In summary, given the complexities of 
ligand processing, receptor interactions, and signaling path- 
ways, it becomes clear why redundancy in TGF1, 2, and 3 
function has not been detected at the whole animal level, 
even though there is considerable overlap in expression of 
Tgfb gene family members. Consequently, if other gene fami- 
lies function with similar complexity, it is likely that, in the 
final analysis, little functional redundancy will be found 
within gene families. 

Two striking examples of apparent functional redundancy 
are worth considering. The first involves myogenic genes, 
and the second involves retinoic acid receptors. Contrary to 
early interpretations, redundancy does not now appear to be 
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Figure 1. TGFp signaling pathway. The TGFp ligands, TGFpl (pi), TGF02 (p2), and TGFp3 ((33), exist primarily in a latent form in vivo 
and are activated by mechanisms not yet clear. In general, TGFp 2 interacts with a TGFp type III receptor (RIII) before interaction with 
TGFp type II (RID and TGFp type I (RI) receptors; whereas, the TGFpl and TGFp3 ligands can interact directly with the type II receptor. 
The ligand receptor complexes can then associate with several cytoplasmic molecules, farnesyl protein transferase (FPT) and FK506 
binding protein-12 (FKBP-12), being two potential examples. The receptor-ligand complex signals to the nucleus through threonine/ 
serine phosphorylation of a series of SMAD proteins (related to the Drosophila "mothers against decapentaplegic" protein) which then 
elicit trans ciptional regulation of extacellular matrix, cell cycle, differentiation and growth factor receptor genes. The roles of the associ- 
ated cytoplasmic molecules FPT and FKBP-12 are not clear but are thought to involve RAS pathway signaling and modulation of signal- 
ing through the SMAD proteins. 



the case for two of the myogenic genes known to be essential 
for specification of vertebrate skeletal muscle, Myod and 
MyfS. Even though the individual knockouts have muscle, 
and only the combined knockouts do not have muscle (45), it 
is now clear that each gene functions in the specification of 
distinct muscle cell lineages. Consequently, in the absence of 
one source of muscle cells, the other source may compensate 
for that (46, 47). This should be termed developmental com- 
pensation, rather than gene redundancy. On the other hand, 
with respect to retinoic acid receptors, there is also good evi- 
dence for functional redundancy. Similar to the myogenic 
genes, retinoic acid receptor gene knockout mice have few 
phenotypes, whereas the combined knockouts have many 
phenotypes (48, 49). Whether this turns out to be gene re- 
dundancy or another case of developmental compensation 
remains to be determined. 



Lack of phenotype: As is the case for TGFp, there also is 
a multitude of reports indicating that the FGFs 1 and 2 have 
important roles in numerous cell types and tissues. Conse- 
quently, when the Fgf2 gene was knocked out by gene tar- 
geting, it was quite surprising that there was no obvious 
phenotype (50). The Fgf2' h animals live a long, healthy life, 
and fertility and fecundity are normal. Even the pituitary 
gland, which is the best source of FGF2, appears not to have 
morphologic defects. The only evidence for any developmen- 
tal abnormalities is found in hematopoiesis (50), where 
blood platelet counts are high, and in the cerebral cortex (51, 
52), where morphometric analysis reveals decreased cell 
density. Clearly, these abnormalities are minor, compared 
with expectations. This was all the more evident because our 
transgenic mice, in which the human FGF2 gene was ubiq- 
uitously overexpressed by the phosphoglycerate kinase pro- 
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Table 2. Nonoverlapping phenotypes of Tgfbl, 2, and 3 knockout mice 
and the penetrance of those phenotypes 



Knockout mouse phenotype 



Penetrance (%)* 



Tgfbl 

Embryo lethalities 

Preimplantation lethality 

Yolk sac lethality 

Adult phenotypes 

Multifocal autoimmunity 

Platelet defect 

Colon cancer 

Failing heart 
Tgfb2 (all perinatal lethalities) 
Heart defects 

Ventricular septum defects 

Dual outlet right ventricle 

Dual inlet left ventricle 
Inner ear defect— lacks spiral limbus 
Eyes 

Ocular hypercellularity 
Reduced corneal stroma 
Urogenital defects in kidney 
Dilated renal pelvis 
Agenesis (females only) 
Uterine horn ectopia 
Testicular ectopia 
Testis hypoplasia 
Vas deferens dysgenesis 
Lung-postnatal 

Dilated conducting airways 

Collapsed bronchioles 
Skeletal defects 

Occipital bone 

Parietal bone 

Squamous bone 

Palatine bone (cleft palate) 

Alisphenoid bone 

Mandibular defects 

Short radius and ulna 

Missing deltoid tuberosity and third trochanter 
Sternum malformations 
Rib barreling 
Rib fusions 
Spina bifida 
Tgfb3 (perinatal lethality) 
Cleft palate 



50 

50 

50 b 

50 
100 c 
100 ed 
100 ed 
100" 1 

100 

94 

19 

25 
100 

100 
100 

30 
20 
40 
100 
20 
20 

100 
100 

100 
100 
100 
22 
100 
100 
100 
94 
25 
94 
13 
100 

100 



"See Table 3 for background dependency of Tgfbl knockout phenotypes. 
described in references 64, 67. , * ■ * u ^u 

'Refers to percentage penetrance among animals that survive to birtn. 
d Unpublished observations. , . T . of 

Details on the remaining phenotypes can be found in the text and in ref- 
erences 29-31, 63. 

moter (53), had very short legs, suggesting an important role 
of FGF2 in bone development, yet the bones of the knockout 
animals were normal. This apparent discrepancy between 
the transgenic and knockout mice indicates that some other 
FGF signals through the same FGF receptor as does FGF2, 
and that this other FGF is the true ligand that is important 
in bone development. Another possibility is that there is "de- 
velopmental compensation" by alternative mechanisms. In 
other words, the absence of FGF2 may cause developmental 
abnormalities during bone development that are then com- 
pensated for by another developmental pathway. This alter- 
native would not necessarily require a different FGF to be 
involved. 

After we had made our first analysis of the Fgf2 knockout 
mouse and did not find an obvious phenotype, it was easy to 
explain the "lack of phenotype" by invoking redundancy be- 
cause there are at least 18 known Fgf genes. But in hind- 
sight, it now appears more likely that all members of this 
large gene family have specific functions, even though they 



signal through receptors encoded by only four receptor genes 
(54) In Fgf2 knockout mice, evidence was not found for up- 
resulation of the two ligands most structurally related to 
FGF2 namely, FGFs 1 and 5 (50). Also, genetic combination 
of Fgf2 and Fgf5 (50) did not reveal redundancy between 
these similar genes. In addition, further analysis of the 
mice revealed roles being played in hematopoiesis and 
vascular tone control (50) as well as in brain development 
and wound healing (51, 52). Finally, in addition to Fgf2, 
Fgfs3-b 7 8 also have been ablated by gene targeting, re- 
vealing functions in proliferation of the inner cell mass 
(Fgf4) (55); gastrulation and cardiac, craniofacial, fore- 
brain, midbrain, and cerebellar development (Fgf8) (56); 
brain and inner ear development {Fgf3) (57, 58); and two 
aspects of hair development (FgfZ and 7) (59, 60). lb date 
comparison of Fgf knockout phenotypes from 6 of the 18 
Fgf genes has not turned up overlap except possibly m the 
cerebellum. Together, these results indicate that each 
gene has important unique functions. Although a few re- 
dundant functions may eventually be found on combina- 
tion of Fgf2 with all other Fgfs except Fgfr it is clear that 
6 of the 18 Fgf genes studied by gene targeting have been 
associated with essentially unique knockout phenotypes. 

lb summarize, what originally appeared as "lack of pheno- 
type" led many of us to the premature conclusion that other 
FGFs must have functions redundant to those of FGF2. 
However, further analysis of Fgf2 knockout mice has since 
revealed a wealth of unique functions ranging from thromb- 
ocytosis and vascular tone control (50) to brain development 
and wound healing (51, 52). It is my expectation that further 
physiologic analysis of the Fgf2 knockout mouse will reveal 
functions in the hypertrophic response to hypertension and 
responses to ischemia/reperfusion injury and bone injury. In 
the final analysis, it is likely that the major roles of FGF2 
may have less to do with getting us to birth than with keep- 
ing us alive after birth, whereas several other FGFs clearly 
have developmental roles. 

Effects of genetic background on phenotypic varia- 
tion: From 100 years of mouse genetics, it has become clear 
that genetic background plays an important role in the sus- 
ceptibility of mice to many disorders. Therefore, the pheno- 
types of knockout mouse strains will also have genetic 
background dependencies, as was first documented by the 
Magnuson and Wagner groups (61, 62). The Tgfbl knockout 
mice are an exceptional case in point (Table 3). On a mixed 
(50-50) 129 x CF1 background (CF1 is a partially outbred 
strain), about half of Tgfbl knockout mice die from a preim- 
plantation developmental defect (63), and the other half die 
of an autoimmune-like multifocal inflammatory disease at 
about weaning age (29). If the targeted Tgfbl allele is back- 
crossed onto a C57BL/6 background, 99% of all knockout 
animals die of the preimplantation defect(63). However^ if a 
Tgfbl knockout allele is put onto a mixed 129 x NIH/Ola x 
C57BL/6 background, embryo lethality is observed during 
yolk sac development, not during preimplantation develop- 
ment (64). With respect to the multifocal inflammatory dis- 
order of Tgfbl knockout mice, if the targeted allele is put 
onto a 129 x CF1 mixed background (50:50), severe inflam- 
mation exists only in the stomach (29); on the mixed 129 x 
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Table 3. Background dependency of Tgfbl knockout phenotypes 



Phenotype penetrance on various strains (%) 



Phenotype 



129 x CF1 


129 x C57 


129 x C3H 


C57 


129 


C3H 


50 


ND 


ND 


99 


ND 


ND 


0 


0 


ND 


ND 


ND 


ND 


50 


50 


50 


1 


ND 


ND 


90 l> 


20 b 


ND 


ND 


ND 


ND 


0 


70" 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


ND 


100 


0 



129xC57x NIH/Ola 



Preimplantation lethality 

Yolk sac lethality" 

Autoimmune disease 
Gastric inflammation 
Intestinal inflammation 

Colon cancer* 



0 
50 
50 
ND 
ND 
ND 



Percentage of knockout animals of a given strain that have the designated phenotype. 
"For details, see references 64, 67. 

'Approximately 10% of animals with autoimmune disease have no detectable gastrointestinal tract inflammation. 
Unpublished observations. 
ND = not determined. 



NIH/Ola x C57BL/6 background, the intestines are more se- 
verely inflamed than is the stomach (65). Finally, on a pre- 
dominantly 129 background (129 x CF1; -97:3), Tgfbl 
knockout mice develop colon cancer if the inflammatory dis- 
order can be eliminated by other genetic manipulations that 
render the mice immunodeficient (unpublished observa- 
tions). However, on a predominantly C3H background, im- 
munodeficient Tgfbl knockout mice do not develop colon 
cancer (66). These results suggest that modifier genes exist 
that can significantly affect the function of TGFpi in preim- 
plantation development, yolk sac development, bowel and 
gastric inflammation, and colon tumor suppression. 
Progress toward localizing a modifier gene for the yolk sac 
developmental problem has been made (67). 

What is the best genetic background for knockout 
mice? Because background-dependent phenotypic variabil- 
ity will likely be found for most knockout mice, it will be use- 
ful to backcross a targeted allele onto several mouse 
backgrounds to make congenic strains. In this section, it will 
be argued that putting a targeted allele on a mixed strain 
background will also provide useful information. This is not 
to say that congenic strains are not useful. Rather, the point 
to be made here is that there also are benefits to looking at 
mixed strain backgrounds. Again, our experience with Tgfb 
knockout mice will be instructive. 

Generating homozygous mutant knockout animals on a 
mixed genetic background is faster. The ES cells are nearly 
always from a 129 strain, and the blastocysts into which the 
targeted ES cells are injected are nearly always C57BL/6. 
For reasons unknown, this is a good combination for estab- 
lishing germline transmission of the injected ES cells. The 
resulting chimeras can then be crossed with any strain de- 
sired, but 129, C57BL/6, or Black Swiss mice are most often 
used, and CF1 mice were used in the case of our Tgfbl 
knockout mice. Heterozygous offspring from this crossing 
will then be inbred 129 or Fl hybrids of 129 and one of the 
other strains. Clearly then, the quickest route to having the 
knockout allele on an inbred strain is through 129. For the 
other strains several generations of backcrossing is re- 
quired, which can take well over a year. Unfortunately, 
strain-129 mice have low fertility and fecundity. Conse- 
quently, the number of offspring per litter is usually fewer 
than six. Although 129 x C57BL/6 hybrids are more robust, 
upon backcrossing onto C57BL/6, litter size decreases, lb 
the contrary, the Black Swiss and CF1 strains are robust, 
and fitter size often is in excess of 12. The reason for this is 
probably because they are not truly inbred strains, but 



rather are partially outbred through random breeding 
within their respective strains. Therefore, one of the choices 
one has is to stay with "pure" genetics at the expense of a 
lower production rate and considerable delay before genera- 
tion of experimental animals, or sacrifice some genetic pu- 
rity to obtain a more efficient production colony. Ideally, one 
would want to do both, but this often is too expensive. 

Mixed genetic background knockout mice often have a 
wider range of phenotypes. The Tgfbl knockout mice back- 
crossed onto either the 129 or C57BL/6 background 
(congenics) yield only embryo lethality (63, unpublished ob- 
servations). On the other hand, when the knockout allele is 
maintained on mixed genetic backgrounds, embryo and 
adult phenotypes are maintained. 

The Tgfb2 & Tgfb3 knockout mice provide further ex- 
amples. The Tgfb2 knockout mice have more than two dozen 
congenital defects and die either immediately preceding or 
during birth, or within 2 h thereafter (30). Table 2 indicates 
that most of the phenotypes are only partially penetrant. 
Though it is not documented, it is likely that the penetrance 
of some of these phenotypes would increase to nearly 100%, 
and some of the other phenotypes would disappear were we 
to put the Tgfb2 knockout allele on inbred backgrounds. 
Hence, the mixed strain background probably provided more 
information than would congenic strains. 

The Tgfb3 knockout mice have a cleft palate (31). One 
colony of Tgfb3 knockout mice was left as a mixed back- 
ground (129 x CF1; 50:50) strain, whereas another colony 
was backcrossed several generations to the C57BL/6 strain. 
These two colonies had considerable expressivity differ- 
ences; the inbred colony had more severe clefting than did 
the mixed background colony. In the latter, expressivity of 
clefting varied widely from animal to animal. This variable 
expressivity within the mixed background colony provided 
us with the opportunity to obtain far more data on develop- 
ment of the cleft palate and was, therefore, more useful for 
making assumptions about the cellular and molecular 
mechanisms by which TGF03 supports palate fusion. Hence, 
using the Tgfb3 knockout mice, the mixed strain background 
provided more information than did the congenic strain. 
Consequently, a wider range of penetrance and expressivity 
of phenotype is a major advantage of investigating knockout 
phenotypes in mixed background colonies. Further, variable 
penetrance of phenotype in a mixed background colony sug- 
gests that there are modifier genes for each phenotype that 
could be obtained by linkage studies. 
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Conclusions 

Questions have been addressed that arose from the last 8 
years in which knockout mice have been investigated to ana- 
lyze gene function at the whole animal level. These questions 
concern gene redundancy, apparent lack of phenotype in a sur- 
prising number of knockout strains, and effects of genetic back- 
ground on knockout phenotype. Using data obtained 
principally from Tgfb and Fgfknockout mice, it is argued that 
there is probably'little redundancy in the genome (i.e., that few 
genes are dispensable for survival of the species). Apparent 
lack of phenotype more likely reflects our inabihty to ask the 
right questions, or our lack of tools to answer them, than it does 
a true lack of function. Finally, discussion of genetic back- 
ground phenotype variability, including variable penetrance 
and expressivity, was used to present some of the advantages of 
working with mixed genetic background colonies of knockout 
mice. For all the examples given here, there are counter ex- 
amples that must be taken seriously; consequently, these argu- 
ments must not be taken as absolutes. For example, if a gene in 
a particular mouse strain has recently been duplicated, it will 
most likely be redundant. If one is studying tissue rejection in 
a knockout mouse, the genetic background obviously must be 
well defined and preferably inbred. Or, if one wants to use the 
susceptibility of a particular mouse strain to cancer to investi- 
gate the function of the knockout gene in progression of that 
cancer, the knockout allele must be put on that mouse strain. 
In general, however, when setting up approaches for investigat- 
ing a new gene knockout mouse, I believe one would be well 
advised to assume that: there is little gene redundancy in 
mammals; there are knockout phenotypes even if none are im- 
mediately apparent; and investigating phenotypes in mixed 
genetic background colonies may not only reveal more pheno- 
types, but may lead to better understanding of the molecular or 
cellular mechanism underlying the phenotype, and may lead to 
modifier gene discovery. 
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Researchers to Gain Wider Access to 
Knockout Mice 

Trans-NIH Effort Provides New Models for 
Understanding Human Disease 

BETHESDA, Md., Wed., Oct. 5, 




2005 - The National Institutes of 
Health (NIH) today announced 
contracts that will give researchers 
unprecedented access to two 
private collections of knockout 



mice, providing valuable models for the study of human disease 
and laying the groundwork for a public, genome-wide library of 
knockout mice. 



Under terms of three-year contracts jointly funded by 19 NIH 
institutes, centers and offices, Deltagen Inc. of San Carlos, Calif., 
and Lexicon Genetics Incorporated of The Woodlands, Texas will 
provide NIH and its scientific partners with access to extensively 
characterized lines of mice in which a specific gene has been 
disrupted, or "knocked out." In the first year of the contract, NIH 
will expend about $10 million to acquire about 250 lines of 
knockout mice. 



For each mouse line, the contractors will provide not only the 
mouse line itself, but also detailed, objective data on the impact 
of the specific gene deletion on the mouse's phenotype, which 
includes appearance, health, fitness, behavior, ability to 
reproduce, and radiological and microscopic data. Such 
comprehensive information on such a large group of mice has 
never been available to public sector researchers, and is expected 
to greatly accelerate efforts to explore gene functions in health 
and disease. 
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"Our decision to procure these knockout mouse lines and data 
and make them available to the research community will yield 
tremendous benefits, both in the short and long terms," said NIH 
Director Elias A. Zerhouni, M.D. "This trans-NIH initiative will 
place important mouse models into the hands of researchers, 
speeding advances in the understanding of human disease and 
the development of new therapies. It also represents a significant 
step in the direction of launching an international project to 
systematically knock out all genes in the mouse." 




Since the early 
1980s, when 
recombinant 
DNA 

technology was 
used to create 
the first such 
animals, 
knockout mice 
have proven to 
be one of the 
most powerful 

tools available to study the function of genes and to create 
mouse models of human disease. Researchers have produced 
knockout mice with characteristics similar to humans suffering 
from a wide range of disorders, including cancer, heart disease, 
neurological disorders and even obesity. 



A knockout moose model of obesity, +ef£, compared wtth a normal mouse- 
Photo courtesy of Lexicon Genetics Incorporated. 



The process used by NIH to select the mouse lines involved a 
rigorous scientific review process that evaluated information on 
the knocked out gene, the reliability of the method used to 
produce the knockout, and whether the mouse line possesses a 
"reporter" gene, which enables researchers to analyze the pattern 
of the knockout gene's expression in various mouse tissues. 



"This is exciting news for all researchers working to understand 
the complex underpinnings of human biology in health and 
disease. Knockout mice provide one of the quickest, most cost 
effective ways to explore gene function. It is essential that we 
make it possible for more researchers to tap into this power," 
said James Battey, M.D., Ph.D., director of the National Institute 
on Deafness and Other Communication Disorders. Dr. Battey 
serves as chair of the Trans-NIH Mouse Initiative, which develops 
priorities for mouse genomics and genetic resources at NIH. 



The new contracts provide NIH with irrevocable, perpetual, 
worldwide, royalty-free licenses to use and distribute to academic 
and non-profit researchers these lines of knockout mice. The 
mouse lines, which will be stored in the form of frozen embryos, 
frozen sperm and frozen embryonic stem (ES) cells, will be 
delivered to NIH-funded mouse repositories that supply mice to 
universities, medical schools and research labs all over the world. 
When researchers express interest in obtaining a certain 
knockout mouse line, the repositories will send them live mice, 
frozen embryos, sperm, and/or ES cells, so they can study the 
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mice in their own labs. All data on the mice will be made available 
to researchers worldwide without restriction in publicly available 
databases on the Web. 

Under the license agreements with Deltagen and Lexicon, 
researchers who receive the knockout mice lines are free to 
publish any results from research involving the line and also to 
seek patent or other intellectual property protection for any of the 
inventions or discoveries resulting from such research. 

The 19 NIH institutes, centers and offices contributing to the 
contracts are: National Center for Complementary and Alternative 
Medicine, National Center for Research Resources, National Eye 
Institute, National Human Genome Research Institute, National 
Heart, Lung and Blood Institute, National Institute on Aging, 
National Institute of Alcohol Abuse and Alcoholism, National 
Institute of Allergy and Infectious Diseases, National Institute of 
Arthritis and Musculoskeletal and Skin Diseases, National 
Institute of Child Health and Human Development, National 
Institute on Deafness and Other Communication Disorders, 
National Institute of Dental and Craniofacial Research, National 
Institute on Drug Abuse, National Institute of Environmental 
Health Sciences, National Institute of General Medical Sciences, 
National Institute of Mental Health, National Institute of 
Neurological Disorders and Stroke, National Institute of Nursing 
Research, and the Office of AIDS Research. 

In order to build upon the acquisition of knockout mice available 
from the private sector, the NIH in September issued a set of 
requests for applications to establish a Knockout Mouse Project. 
The ambitious goal of this trans-NIH program is to produce a 
comprehensive resource of mouse mutants in which every gene 
in the mouse genome has been knocked out. The resource will 
serve to further the value of the mouse as a powerful and 
important tool in the study of human health and disease. 

For more information on what knockout mice are, how they are 
made and what they are used for, go to 
www.qenome.gov/12514551 . To download a high-resolution 
photo of a knockout mouse, go to www. q enome. gov /17015128 . 

NIH is the steward of medical and behavioral research for the 
United States. It is an agency of the U.S. Department of Health 
and Human Services. For more information about NIH, go to 

www.nih. g ov . 

Contact: 

Geoff Spencer, NHGRI 
(301) 402-0911 
spencerg@maiLnih.gov 
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EX-99.2 4 dex992.htm DELTAGEN, INC. NEWS RELEASE DTD FEBRUARY 11, 2002 

Exhibit 99.2 

Deltagen and Merck Enter Into DeltaBase License 
Agreement 

REDWOOD CITY, Calif, Feb. 1 1/PRNewswire-FirstCall/ — Deltagen, Inc. (Nasdaq: DGEN) announced today that it has 
entered into a license agreement to provide Merck & Co., Inc. (NYSE: MRK) with access to Deltagen's proprietary 
DeltaBase™ product, a powerful resource tool for the understanding of in vivo mammalian gene function information. 

Merck will have non-exclusive access to information related to 750 genes selected for their biological interest that have been 
functionally characterized and entered into DeltaBase, Merck will also have access to certain of the corresponding DeltaBase 
intellectual property rights. Financial terms were not disclosed. "We are delighted to extend our relationship with Merck, 
which has a worldwide reputation in pharmaceutical discovery and development, by adding them as a DeltaBase subscriber," 
said William Matthews, Ph.D., president and chief executive officer at Deltagen. 

Through its proprietary product DeltaBase™, Deltagen provides pharmaceutical companies with critical information to better 
understand the in vivo function of mammalian genes, their relationship to other genes and the biochemical pathways for large 
segments of the genome. Each gene is the focus of an exhaustive investigation; more than 20,000 pieces of data from each 
gene are distilled into disease-relevant frameworks that include proprietary knockout mouse phenotypic, expression profile 
data and other proprietary target validation data. Information in DeltaBase is generated using Deltagen's large-scale mouse 
gene knockout technology and standardized phenotypic analysis protocols. 

Deltagen is a genomic-based biotechnology company headquartered in Redwood City, California, that provides data to 
pharmaceutical and biotechnology companies on the function, role and disease relevance of mammalian genes. This 
information may facilitate the discovery and validation of drug targets to advance the development of new genomic-based 
medicines. Deltagen's principal product, DeltaBase, provides a database of in vivo derived, mammalian gene function 
information. In addition, the company is dedicated to determine the function of secreted proteins and is undertaking the 
discovery and development of biotechnology drug candidates internally or in collaboration with other parties. Current 
DeltaBase subscribers can be found on Deltagen's website, 
www. deltagen. com. 

Except for the historical information contained herein, the matters set forth in this press release, including statements as to the 
role that Deltagen's DeltaBase product and gene function database information will play in third-party research programs and 
the extent to which genome-based research will assist researchers in their drug discovery efforts, are forward-looking 
statements within the meaning of the "safe harbor" provisions of the 
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Private Securities Litigation Reform Act of 1995. These forward-looking statements are subject to risks and uncertainties that 
may cause actual results to differ materially from those set forth in the forward-looking statements, including the extent to 
which genomic databases are utilized in pharmaceutical research and development; the ability of Deltagen to provide 
products and services that meet market needs; the impact of competition and alternative technologies, processes and 
approaches; and other risks cited in the risk factors sections of Deltagen's Annual Report on Form 10-K filed with the 
Securities and Exchange Commission and Deltagen's other securities filings with the Commission. These forward-looking 
statements speak only as of the date hereof. Deltagen disclaims any intent or obligation to update these forward-looking 
statements. 
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EXHIBIT K 



Declaration of Robert Driscoll, filed June 22, 2005 



7005,467 



PATENT 



Title: 



Filed: 



Applicant: 
Serial No.: 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Keith D. Allen Examiner: Qian, Celine X 

10/005,467 Group Art Unit: 1632 

December 4, 2001 Docket No.: R758/75658.029500 

Transgenic Mice Containing PTP36 Tyrosine Phosphatase Gene Disruptions 



DECLARATION OF ROBERT DRISCOLL PURSUANT TO 37 C.F.R. § 1.132 



Commissioner for Patents 
P.O. Box 1450 
Alexandria, VA 22313-1450 



I, Robert Driscoll, residing at 23 Chicory Lane, San Carlos, CA 94070, hereby declare: 

1 . I am presently employed as Vice President of Intellectual Property & Legal 
Affairs at Assignee, Deltagen, Inc., in San Carlos, CA. I have also previously served as the 
Company's Senior Director of Intellectual Property, in which position I managed and oversaw 
the Company's intellectual property portfolio, including the Company's patent filings. I possess 
a Ph.D in Chemistry, received from the California Institute of Technology. I also possess a ID., 
received from Loyola Law School, Los Angeles. I am a registered patent attorney (Reg. No. 
47,536). 

2. I am familiar with the above-cited application. I am familiar with the Office 
Action mailed March 22, 2005. I am aware that the Examiner has rejected the claims, in part, for 
allegedly failing to meet the utility requirement. I am also aware that the Applicant has argued 
that a commercial sale of a mouse with a disrupted PTP36 tyrosine phosphatase allele within the 
scope of the claimed subject matter ( U PTP36 tyrosine phosphatase gene knockout mouse") 
should satisfy the utility requirement. 



Sir: 



3. In support of the Applicant's aforementioned argument, I hereby state that I have 
reviewed Deltagen' s internal sales records regarding the PTP36 tyrosine phosphatase gene 



n 



knockout mouse. According to these records, the PTP36 tyrosine phosphatase gene knockout 
mouse has been delivered to at least one (1) large pharmaceutical company. The contractual 
terms by which the mice were transferred prohibit Deltagen from identifying the name of this 
company. However, the company is ranked among the top 10 pharmaceutical companies 
worldwide (based on sales), 

4. It is my understanding, based on communications with our pharmaceutical 
company customers, that transgenic knockout mice obtained from Deltagen are used for studying 
gene function and for human therapeutic drug development. 

5. I further declare that all statements made herein of my own knowledge are true; 
and further that these statements were made with knowledge that willful false statements and the 
like so made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of 
the United States Code, and that such willful false statements may jeopardize the validity of the 
above-referenced application or any patent issuing thereon. 




R&beifDriscoll, Ph.D, Reg. No. 47,536 



/4 ^21 



Date 
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The price of innovation: new estimates 
of drug development costs 

Joseph A. DiMasi 3 '*, Ronald W. Hansen b , Henry G. Grabowski c 

a Tufts Center for the Study of Drug Development, Tufts University, 192 South Street, 
Suite 550, Boston, MA 02111, USA 
William E. Simon Graduate School of Business Administration, University of Rochester, Rochester, NY, USA 
c Department of Economics, Duke University, Durham, NC, USA 

Received 1 7 January 2002 ; received in revised form 24 May 2002 ; accepted 28 October 2002 



Abstract 

The research and development costs of 68 randomly selected new drugs were obtained from a sur- 
vey of 1 0 pharmaceutical firms. These data were used to estimate the average pre-tax cost of new drug 
development. The costs of compounds abandoned during testing were linked to the costs of com- 
pounds that obtained marketing approval. The estimated average out-of-pocket cost per new drug is 
US$ 403 million (2000 dollars). Capitalizing out-of-pocket costs to the point of marketing approval 
at a real discount rate of 1 1 % yields a total pre-approval cost estimate of US$ 802 million (2000 dol- 
lars). When compared to the results of an earlier study with a similar methodology, total capitalized 
costs were shown to have increased at an annual rate of 7.4% above general price inflation. 
© 2003 Elsevier Science B.V. All rights reserved. 

J EL classification: L65; 031 

Keywords: Innovation; R&D cost; Pharmaceutical industry; Discount rate; Technical success rates 



1. Introduction 

Innovations in the health sciences have resulted in dramatic changes in the ability to treat 
disease and improve the quality of life. Expenditures on pharmaceuticals have grown faster 
than other major components of the health care system since the late 1990s. Consequently, 
the debates on rising health care costs and the development of new medical technologies 
have focused increasingly on the pharmaceutical industry, which is both a major participant 
in the health care industry and a major source of advances in health care technologies. 

* Corresponding author. Tel.: + 1-617-636-21 16. 
E-mail address: joseph.dimasi@tufts.edu (J.A. DiMasi). 

0167-6296/03/$ - see front matter © 2003 Elsevier Science B.V. All rights reserved, 
doi : 1 0. 1 0 1 6/S0 1 67-6296(02)00 1 26- 1 
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One of the key components of the discussion is the role of private sector pharmaceutical 
industry investments in R&D and an understanding of the factors that affect this process. 
Although the industry engages in many forms of innovation, in general the most significant 
is the discovery and development of new chemical and biopharmaceutical entities that 
become new therapies. Our prior research (DiMasi et al., 1991) found that the discovery 
and development of new drugs is a very lengthy and costly process. In the research-based 
drug industry, R&D decisions have very long-term ramifications, and the impact of market 
or public policy changes may not be fully realized for many years. From both a policy 
perspective, as well as an industrial perspective, it is therefore important to continue to 
analyze the components of and trends in the costs of pharmaceutical innovation. 

In this paper we will build on research conducted by the current authors (DiMasi et al., 
1991) and others on the economics of pharmaceutical R&D. As we described in our prior 
study, "Empirical analyses of the cost to discover and develop NCEs are interesting on 
several counts. First, knowledge of R&D costs is important for analyzing issues such as 
the returns on R&D investment. Second, the cost of a new drug has direct bearing on 
the organizational structure of innovation in pharmaceuticals. In this regard, higher real 
R&D costs have been cited as one of the main factors underlying the recent trend toward 
more mergers and industry consolidation. Third, R&D costs also influence the pattern 
of international resource allocation. Finally, the cost of R&D has become an important 
issue in its own right in the recent policy deliberations involving regulatory requirements 
and the economic performance of the pharmaceutical industry". In the decade that has 
followed the publication of our earlier study, these issues remain paramount. In addition, 
the congressional debates on Medicare prescription drug coverage and various new state 
initiatives to fill gaps in coverage for the elderly and the uninsured have intensified the 
interest in the performance of the pharmaceutical industry. 

In the current study we are not attempting to directly answer the policy debates men- 
tioned above. Rather, our focus is on providing new estimates of economic parameters 
associated with the drug development process. In particular, we concentrate on estimates 
of the costs of pharmaceutical innovation. Our prior estimates have been used by the Office 
of Technology assessment (OTA), the Congressional Budget Office (CBO), and various 
researchers to analyze policy questions such as the effects on R&D activities of health care 
financing reform or changes in intellectual property legislation related to the pharmaceutical 
industry. 

The approach used in this paper follows our previous study (DiMasi et al., 1991) and 
the earlier work by Hansen (1979). Given the similarity in methodologies, we are able 
to compare our results in the current study with the estimates in the earlier studies to 
illustrate trends in development costs. All three studies used micro-level data on the cost 
and timing of development obtained through confidential surveys of pharmaceutical firms 
for a random sample of new drugs first investigated in humans by these firms. In the current 
study, the new drugs were first tested in humans anywhere in the world between 1983 and 
1994. The reported development costs ran through 2000. Ultimately, we are interested in 
the expected cost of development per approved new drug. The uncertainties in the research 
and development process result in expenditures on many development projects that are not 
successful in producing a marketed product. However, to produce an estimate of expected 
cost for a marketed product, we must allocate the costs of the unsuccessful projects to those 
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that result in a marketed new product. The R&D process is lengthy, and as such it is important 
to know at what stage of development expenses occur. Viewed as an investment project, it 
is necessary to know both the amount of expenditures and the timing of these expenditures, 
since funds committed to R&D in advance of any returns from sales have both a direct 
and an opportunity cost. We used a unique database to estimate various cost parameters 
in the development process. Of particular concern is the estimation of the average pre-tax 
cost of new drug development, since we are interested in the resource costs of new drug 
development and how they have changed over time. 

1.1 . Previous studies of the cost of pharmaceutical innovation 

A summary of early studies of the cost of drug development can be found in the authors' 
previous study (DiMasi et al., 1991) and in OTA (1993). In brief, the early studies were 
either based on a case study of a specific drug (usually ignoring the cost of failed projects) 
or relied on aggregate data. Since the R&D process often extends for a decade or more 
and the new drug development process often changes, it is difficult to estimate the cost 
of development from aggregated annual data. In contrast, the study by Hansen (1979) 
and the current authors' previous study (DiMasi et al., 1991) estimated development cost 
based on data supplied by firms for a representative sample of drug development 
efforts. 

DiMasi et al. (1991) used data on self-originated new drugs to estimate the average cost 
of developing a new drug. They obtained data from 1 2 pharmaceutical firms on the research 
and development costs of 93 randomly selected new drugs that entered clinical trials be- 
tween 1970 and 1982. From these data they estimated the average pre-tax out-of-pocket 
cost per approved drug to be US$ 114 million (1987 dollars). Since these expenditures 
were spread out over nearly a dozen years, they capitalized these expenditures to the date 
of marketing approval using a 9% discount rate. This yielded an estimate of US$ 23 1 
million (1987 dollars). Measured in constant dollars, this value is more than double that 
obtained by Hansen for an earlier sample. DiMasi et al. (1991) also found that the average 
cost of the first two phases of clinical trials doubled between the first and second half of 
their sample. This led to the expectation that development costs would be higher in future 
samples. 

Based on an analysis by Myers and Shyam-Sunder performed for the OTA, the OTA 
(1993) report noted that the cost-of-capital for the industry was roughly 10% in the early 
1980s. This is moderately higher than the 9% used by DiMasi et al. (1991). The OTA also 
recalculated the DiMasi et al. (1991) numbers using an interest rate that varied over the life 
of the R&D cycle thereby raising the cost estimate by US$ 100 million in 1990 dollars. 1 
The OTA presented both pre- and post-tax cost estimates. 



1 The OTA applied a range of discount rates that varied with the time to marketing approval. They chose 14% 
for the earliest stage R&D and 10% for development just prior to approval, with rates in between that declined 
linearly with time in development. This approach was meant to capture the essence of the risk-return staircase 
perspective expressed by Myers and others, and discussed below. The methodology described in Myers and Howe 
(1997) is actually quite different, but the OTA technique yielded results that would not be much different (for the 
same distribution of costs) than what one would have obtained with the correct methodology (Myers and Howe, 
1997, p. 33). 
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| » NCEs -»- Real R&D (2000$) — NCE Trend] 



Fig. I , Inflation-adjusted industry R&D expenditures (2000 dollars) and US new chemical entity (NCE) approvals 
from 1963 to 2000. Source of data: PhRMA (2001) and Tufts CSDD Approved NCE Database. 



12- Aggregate data analyses 

There have been no recent comprehensive studies of the cost of developing new pharma- 
ceuticals from synthesis to marketing approval based on actual project-level data. However, 
aggregate data and data on parameters of the drug development process suggest that R&D 
costs have increased substantially since our earlier study. For example, the Pharmaceutical 
Research and Manufacturers of America (PhRMA, 2000) publishes an annual report on the 
R&D expenditures of its member firms that shows a continuous increase in outlays well in 
excess of inflation. Reports on specific components of the R&D process, such as the number 
of subjects in clinical trials (OTA, 1993; The Boston Consulting Group [BCG], 1993), also 
suggest an increase in the real cost of pharmaceutical innovation. 

Published aggregate industry data suggest that R&D costs have been increasing. Fig. 1 
shows reported aggregate annual domestic prescription drug R&D expenditures for mem- 
bers of the US pharmaceutical industry since 1963. The chart also shows the number of 
US new drug approvals by year. Given the much faster rate of growth of R&D expendi- 
tures, data such as these suggest that R&D costs have increased over time. However, they 
cannot be conclusive or precise. For one matter, the drug development process is known 
to be very lengthy. Thus, new drug approvals today are associated with R&D expenditures 
that were incurred many years prior. Ignoring the inherent lag structure underlying these 
data and simply dividing current R&D expenditures by the number of new drug approvals 
will in general yield inaccurate estimates. 2 Given a substantial increasing trend in R&D 



2 The estimates would also vary widely from year-to-year. For example, if we divided each year's real R&D 
expenditures by that year's number of NCE approvals, we would obtain US$ 1 billion for 2000, US$ 743 million 
for 1999, US$ 839 million for 1998, US$ 568 million for 1997, US$ 400 million for 1996, US$ 635 million for 
1995, and US$ 878 million for 1994. While there is a general upward trend in such calculations, the year-to-year 
variability is not credible. 
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expenditures, such calculations will result in greatly exaggerated estimates of out-of-pocket 
cost per approval. 

Secondly, even properly lagged time series would tend to be imprecise if aggregate in- 
dustry data were used as reported. The industry data include expenditures on improvements 
to existing products. Thus, they would overestimate pre-approval development costs. On 
the other hand, they also do not incorporate all of the R&D on licensed-in drugs since 
firms or other organizations that are not members of the US trade association would have 
conducted some of the work. On that account the data would tend to underestimate costs. 
Therefore, R&D cost estimates based on project-level data are needed to assure a reasonable 
level of confidence in the accuracy of the results. We present results based on such data in 
this study. 

The remainder of this paper is organized as follows. Section 2 describes the standard 
drug development paradigm, which serves as the structure through which the results are 
reported. Section 3 contains a description of the survey sample data and the population from 
which it was drawn. Section 4 describes the methodology used to derive R&D cost esti- 
mates. We present our base case pre-marketing approval R&D cost estimates in Section 5, 
as well as a comparison of our results with those of earlier studies to examine R&D cost 
trends. Section 6 provides sensitivity analyses for key parameters. Section 7 focuses on 
some extensions of the base case analyses: estimates of clinical development costs for ap- 
proved drugs by therapeutic significance, estimates of post-approval R&D costs, and a tax 
analysis. Section 8 contains data and analyses that corroborate our results. Finally, we offer 
some conclusions in Section 9. 



2. The new drug development process 

New drug development can proceed along varied pathways for different compounds, but 
a development paradigm has been articulated that has long served well as a general model. 
The paradigm is explained in some detail elsewhere (DiMasi et al., 1 99 1 ; US Food and Drug 
Administration [FDA], 1999). In outline form, the paradigm portrays new drug discovery 
and development as proceeding in a sequence of (possibly overlapping) phases. Discovery 
programs result in the synthesis of compounds that are tested in assays and animal models. 
It was not possible to disaggregate our data into discovery and preclinical development 
testing costs, 3 so for the purposes of this study discovery and preclinical development costs 
are grouped and referred to as preclinical costs. 

Clinical (human) testing typically proceeds through three successive phases. In phase I, 
a small number of usually healthy volunteers 4 are tested to establish safe dosages and to 
gather information on the absorption, distribution, metabolic effects, excretion, and toxicity 
of the compound. To conduct clinical testing in the United States, a manufacturer must first 



3 The reported basic research expenditures by firm were highly variable, and suggest that different firms may 
categorize their pre-human testing expenditures somewhat differently. Thus, we report pre-human testing costs in 
one figure. 

4 In some therapeutic areas, testing is initially done on patients who have the disease or condition for which the 
compound is intended to be a treatment. This is ordinarily true in the cancer and AIDS areas. 
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file an investigational new drug application (IND) with the FDA. However, initiation of 
human testing can, and often does, occur first outside the United States. 

Phase II trials are conducted with subjects who have the targeted disease or condition 
and are designed to obtain evidence on safety and preliminary data on efficacy. The number 
of subjects tested in this phase is larger than in phase I and may number in the hundreds. 
The final pre-approval clinical testing phase, phase III, typically consists of a number of 
large-scale (often multi-center) trials that are designed to firmly establish efficacy and to 
uncover side-effects that occur infrequently. The number of subjects in phase III trials for 
a compound can total in the thousands. 

Once drug developers believe that they have enough evidence of safety and efficacy, 
they will compile the results of their testing in an application to regulatory authorities 
for marketing approval. In the United States, manufacturers submit a new drug appli- 
cation (NDA) or a biological license application (BLA) to the FDA for review and 
approval. 



3. Data 

Ten multinational pharmaceutical firms, including both foreign and US-owned firms, 
provided data through a confidential survey of their new drug R&D costs. 5 Data were 
collected on clinical phase costs for a randomly selected sample of the investigational drugs 
of the firms participating in the survey. 6 The sample was taken from a Tufts Center for the 
Study of Drug Development (CSDD) database of investigational compounds. Cost and time 
data were also collected for expenditures on the kind of animal testing that often occurs 
concurrently with clinical trials. 7 The compounds chosen were all self-originated; that is, 
their development up to initial regulatory marketing approval was conducted under the 
auspices of the surveyed firm. 8 Licensed-in compounds were excluded because non-survey 
firms would have conducted portions of the R&D. 9 

We also collected data from the cost survey participants on their aggregate annual phar- 
maceutical R&D expenditures for the period 1980-1999. The firms reported on total an- 
nual R&D expenditures broken down by expenditures on self-originated new drugs, on 
licensed-in or otherwise acquired new drugs, and on already-approved drugs. Annual ex- 
penditures on self-originated new drugs were further decomposed into expenditures during 
the pre-human and clinical periods. 

The National Institutes of Health (NIH) support through their own labs and through grants 
to researchers in academic and other non-profit institutions a substantial amount of research 



5 Using pharmaceutical sales to measure firm size, four of the survey firms are top 1 0 companies, another four 
are among the next 10 largest firms, and the remaining two are outside the top 20 (PJB, 2000). 

6 A copy of the survey instrument is available upon request. 

7 Long-term teratogenicity and carcinogenicity testing may be conducted after the initiation of clinical trials. 

8 This does not preclude situations in which the firm sponsors trials that are conducted by or in collaboration 
with a government agency, an individual or group in academia, a non-profit institute, or another firm. 

9 Large pharmaceutical firms much more often license-in than license-out new drug candidates. Firms that 
license-in compounds for further development pay a price for that right through up-front fees, milestone payments, 
and royalty arrangements. 
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that expands fundamental knowledge about human biology (NIH, 2000; Scherer, 2000). 
This basic research sometimes results in leads that industrial researchers can capitalize 
on to assist them in discovering new therapeutic compounds. 10 Some new compounds 
investigated by pharmaceutical firms, however, originated in government or academic labs. 
It is unclear whether the discovery and early development costs for such compounds are 
similar to those for compounds originating in industrial labs. These drugs, though, represent 
a very small portion of the total number developed. For example, NIH (2000) found that 
of 47 FDA-approved drugs that had reached at least US$ 500 million in US sales in 1999, 
the government had direct or indirect use or ownership patent rights to only four of them. 1 1 
In addition, we used a Tufts CSDD database supplemented by commercial databases to 
determine that of the 284 new drugs approved in the United States from 1990 to 1999, 12 
93.3% originated from industrial sources (either from the sponsoring firm or from another 
firm from which the compound was licensed or otherwise acquired). Government sources 
accounted for 3.2% of these approvals and academia and other non-profits accounted for 
the other 3.5%. 13 

The survey firms accounted for 42% of pharmaceutical industry R&D expenditures. 14 
The survey compounds were selected at random from data contained in the Tufts CSDD 
database of investigational compounds for the firms that agreed to participate in the R&D 
cost survey. Of the 68 compounds chosen, 61 are small molecule chemical entities, four are 
recombinant proteins, two are monoclonal antibodies, and one is a vaccine. Initial human 
testing anywhere in the world for these compounds occurred during the period 1983-1994. 
Development costs were obtained through 2000. 15 



10 The NIH also supports the development of research tools that drug developers find useful. In addition, it funds 
training for many scientists, some of whom eventually are employed in the industrial sector. 

1 1 The four drugs were developed in part through the use of NIH-funded patented technologies. Three of the 
four products are recombinant proteins, with two being the same drug produced by two different companies. Each 
of the relevant patented technologies was developed at academic or non-profit institutions with financial support 
from the NIH. 

12 The definition of a new drug used for this analysis is a therapeutic new molecular entity approved by the FDA's 
Center for Drug Evaluation and Research. 

13 The proportion of investigational drugs that derive from industrial sources is likely to be even higher, since 
acquired drugs have higher clinical approval success rates than do self-originated drugs (DiMasi, 2001b). Our 
cost survey firms were less reliant on licensing-in drugs from non-industrial sources than were firms as a whole; 
98.8% of their new drug approvals during 1 990-1999 were from industrial sources. DiMasi (2000) found markedly 
greater market entry of small niche pharmaceutical firms in the 1990s relative to earlier periods as measured by 
sponsorship of new chemical entity (NCE) approvals. A disproportionate share of the approvals obtained by these 
new entrants was for drugs that originated in academia. 

14 The data used were aggregate firm pharmaceutical R&D expenditures for the cost survey firms, as reported 
on our questionnaire, in comparison to PhRM A member firm R&D expenditures (1 994-1 997) on ethical pharma- 
ceuticals, adjusted to global expenditure levels (PhRM A, 2001). 

15 Surveys were sent to 24 firms (some of whom have since merged). Twelve firms responded that they would 
participate in some form. The data that two firms ultimately provided were not useable. The 10 firms from which 
we used data provided information on 76 compounds. However, the data for eight of these compounds were not 
sufficiently comprehensive to use. The firms that did not participate in the survey cited a number of reasons for 
not doing so. The reasons included the extra demands that the transition effects of a relatively recent merger were 
placing on their relevant personnel, the time and expense of retrieving archival records in the manner required by 
the study, and difficulties in gathering the relevant data in a uniform manner because their accounting systems had 
changed significantly over the study period. 
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We selected a stratified random sample of investigational compounds. Stratification was 
based on the time elapsed since the origination of clinical trials and the current status of 
that testing. Reported costs were weighted to reflect the characteristics of the population, 
so that knowledge of the population from which the sample was drawn was needed. The 
population is composed of all investigational compounds in the Tufts CSDD investigational 
drug database that met study criteria: the compounds were self-originated and first tested in 
humans anywhere in the world from 1983 to 1994, and we had the information necessary 
to classify them according our strata. We found 538 investigational drugs that met these 
criteria. Of these compounds, 82 (15.2%) have been approved for marketing, 9 (1.7%) had 
NDAs or BLAs that were submitted and are still active, 5 (0.9%) had NDAs or BLAs 
submitted but abandoned, 227 (42.2%) were terminated in 4 years or less from the initiation 
of clinical trials, 172 (32.0%) were terminated more than 4 years after the start of clinical 
testing, and 43 (8.0%) were still in active testing as of the most recent check (31 March 
2001). 

Some firms were not able to provide full phase cost data for every new drug sampled. 
For example, phase I cost data were available for 66 of the 68 new drugs. However, we 
had some phase cost data for every drug in the sample. In addition, five compounds were 
still active at the time of the study. For these drugs it is possible that there will be some 
future costs for the drug's most recent phase. Thus, for this reason our cost estimates may 
be somewhat conservative. However, given the small number of drugs in this category and 
the fact that the impact would be on only one phase for each of these drugs, our overall cost 
estimates are not likely to be materially affected. 

4. Methodology for estimating new drug development costs 

The approach that we use to estimate development costs is similar to that described in 
our earlier work (DiMasi et al., 1991). We will outline here the general methodology for 
developing an overall cost estimate. In describing the approach, it will be clear that cost 
estimates for important components of the drug development process will also be derived 
along the way. 

The survey sample was stratified to reduce sampling error. Results from previous anal- 
yses suggested that the variability of drug costs tends to increase with the development 
phase or the amount of time that a drug spends in testing (Hansen, 1979; DiMasi et al., 
1991). Costs for successful drugs (i.e. those that achieve regulatory approval) also tend to 
be higher and more variable than those for drug failures. Thus, we based our strata on the 
length of time that failed compounds were in clinical testing and whether or not a compound 
had reached the stage in which an application for marketing approval had been filed with 
the FDA. 16 



16 Specifically, we used four strata: compounds that failed in 4 years or less of clinical testing; compounds that 
failed after more than 4 years had elapsed from initial human testing; compounds for which an NDA or a BLA had 
been submitted to the FDA; and compounds that were still in active testing (as of 30 March 2001). Compounds 
for which an application for marketing approval had been submitted or which had been abandoned after lengthy 
testing were deliberately oversampled. The reported sample values were then weighted, where the weights were 
determined so that the sample perfectly reflects the population in terms of the four strata. 
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4.1. Expected costs in the clinical period 

Since new drug development is a risky process, with many compounds failing for every 
one that succeeds, it is necessary to analyze costs in expected value terms. The total clinical 
period cost for an individual drug can be viewed as the realization of a random variable, c. 
Given that it is not certain that development of a randomly selected investigational compound 
will proceed to a given phase, we may define expected clinical costs for a randomly selected 
investigational drug to be C = E(c) = p\H\\ e + PuM\\e + PmHn\\ e + PAV<A\e, where /?i,/?h, 
and phi, are the probabilities that a randomly selected investigational compound will enter 
phases I— III, respectively,/^ the probability that long-term animal testing will be conducted 
during the clinical trial period, and the fi's are conditional expectations. Specifically, ii\\ ey 
AMi|e> Mni|e, and fiA\e are the population mean costs for drugs that enter phases I — III, and 
clinical period long-term animal testing, respectively. 

Weighted mean phase costs derived from the cost survey data were used to estimate the 
conditional expectations. A description of how the probabilities were estimated is presented 
in the next section. Assuming that the estimated mean phase costs and success probabilities 
are stochastically independent, the estimated expected value is an unbiased estimate of the 
population expected value. 

4.2. Clinical success and phase attrition rates 

An overall clinical approval success rate is the probability that a compound that enters the 
clinical testing pipeline will eventually be approved for marketing. Attrition rates describe 
the rate at which investigational drugs fall out of testing in the various clinical phases. A 
phase success rate is the probability that a drug will attain marketing approval if it enters the 
given phase. A phase transition probability is the likelihood that an investigational drug will 
proceed in testing from one phase to the next. All of these probabilities can be estimated 
from data in the Tufts CSDD database of investigational drugs from which our survey 
sample was drawn. 

The clinical approval success rate was estimated using a two-stage statistical estimation 
process that has been described in detail elsewhere (DiMasi et al . , 1 99 1 ; DiMasi, 200 1 b). The 
data used here consist of the investigational drugs in the Tufts CSDD database that were first 
tested in humans anywhere in the world from 1983 to 1994, with information on their status 
(approval or research abandonment) obtained through early 2001. Given that some of these 
investigational drugs were still in active testing at the end of the study period, some of the 
data are right-censored. Survival analysis can be applied in such a situation, where survival 
indicates that a drug has not reached its ultimate fate (either approval or abandonment). 

The Tufts CSDD database of investigational compounds contains information on the 
latest phase that an abandoned compound was in when it was terminated. These data were 
used to determine the distribution of research terminations by phases. 17 These results, 



17 A small proportion of the compounds in the database were either still in clinical development (8.0%) or had 
an NDA or BLA filed but not yet approved (1 .7%). For those drugs in these groups that will eventually fail, their 
abandonment will tend to occur in later testing phases. To deal with the potential bias in the estimated distribution 
of research terminations that would result from using just those compounds that had been abandoned by the end of 
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together with the estimated overall clinical approval success rate were used to provide 
estimates of the probability that an investigational drug will enter a given phase, phase 
attrition rates, and phase transition probabilities. The estimated overall clinical approval 
success rate and the probabilities of entering various phases provide results with which 
estimates can be derived that include the cost of drugs that fail to make it through the 
development process. Specifically, we use the probabilities of entering a phase to estimate 
the expected out-of-pocket clinical cost per investigational drug. Adding the out-of-pocket 
preclinical cost estimate described below yields an estimate of total out-of-pocket cost per 
investigational drug. Dividing this estimate by the overall clinical success rate yields our 
estimate of out-of-pocket cost per approved drug. 

4.3. Out-of-pocket discovery and preclinical development costs 

Many costs incurred prior to clinical testing cannot be attributed to specific compounds. 
Thus, aggregate level data at the firm level were used to impute costs per drug for R&D 
incurred prior to human testing. Specifically, time series data for each surveyed firm on 
spending on pre-human R&D and on human testing for 1980-1999 were obtained, and a 
ratio of pre-human R&D expenditures to human testing expenditures was determined based 
on an appropriate lag structure (on average, pre-human R&D expenditures should occur 
years prior to the associated human testing costs). This ratio was then multiplied by an 
estimate of out-of-pocket clinical cost per drug, which is based on the project-level data, to 
yield an estimate of the pre-human R&D cost per new drug. 18 

4.4. Capitalized costs: development times and the cost-of-capital 

Given that drug development is a very lengthy process, the full cost of drug development 
should depend significantly on the timing of investment and returns. Full cost estimates 
require a capitalization of the stream of out-of-pocket costs to some point (the date of 
marketing approval is the standard). To do so, one needs a timeline for a representative drug. 
The timeline is constructed from information on average phase lengths and the average gaps 
and overlaps between successive phases in a Tufts CSDD database of approved new drugs 
and in our cost survey. The periods considered are the time from synthesis to human testing, 

the study period, we statistically predicted whether each open compound (still in clinical testing) would eventually 
fail. To do so, we evaluated an estimated conditional approval probability function (probit specification) at the 
number of years that the compound had been in testing. Failures were taken to occur in the latest reported testing 
phase. Summing the failure probabilities by phase gives us additional terminations by phase. The distribution of 
research terminations by phase was adjusted accordingly. Compounds that had reached the NDA/BLA phase likely 
have a very high probability of success. DiMasi (2001a) found very high approval rates for NDA submissions, 
with an increasing trend. To be conservative, we assumed that all of the compounds with still active NDAs or 
BLAs would be approved. This leads to lower cost estimates than would be the case if the same procedure for 
determining failure that was used for compounds still in testing had been used instead. However, given the very 
small number of compounds in the active NDA/BLA category, the impact on the results is trivial. 
18 The survey firms were asked to indicate whether charges for corporate overhead unrelated to R&D appear in 
their R&D budget data, and, if so, to estimate what share of expenditures they represent. Two firms reported that 
they did, and so we reduced the aggregate and project-level data for those firms according to their reported shares 
for corporate overhead. 
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the three clinical phases, an animal testing phase concurrent with clinical development, and 
the length of time from submission of an NDA/BLA to NDA/BLA approval. 

Whereas the survey data cover a development period that yielded approvals from 1990 
to 2001, the bulk of the approvals occurred in the mid to late 1990s. Thus, we estimated 
phase lengths, gaps, and overlaps for self-originated new drugs that were approved during 
1992-1999. The data included therapeutic biopharmaceuticals, as well as small molecule 
drugs. 19 Once a timeline is established and out-of-pocket costs are allocated over that time- 
line, the expenditures must be capitalized at an appropriate discount rate. The discount rate 
should be the expected return that investors forego during development when they invest 
in pharmaceutical R&D instead of an equally risky portfolio of financial securities. Em- 
pirically, such a discount rate can be determined by examining stock market returns and 
debt-equity ratios for a representative sample of pharmaceutical firms over a relevant pe- 
riod. The resulting discount rate is an average company cost-of-capital. We describe the 
estimation of our base case cost-of-capital in Section 5.2 below. 

We assume that phase costs are distributed uniformly over the phase length and apply 
continuous compounding to the point of marketing approval. Summing these capitalized 
preclinical and clinical capitalized cost estimates yields a total capitalized cost per inves- 
tigational drug. Dividing by the overall clinical success rate results in our estimate of the 
total capitalized cost per approved new drug. This estimate is a measure of the full resource 
cost needed, on average, for industry to discover and develop a new drug to the point of 
marketing approval. 

5. Base case R&D cost estimates 

5.1. Out-of-pocket clinical cost per investigational drug 

Given the method of weighting reported costs as described in Section 4, weighted means, 
medians, and standard deviations were calculated and are presented in Table 1 . 20 Mean 



19 The percentage of all self-originated new compound approvals that are for biopharmaceuticals is substantially 
larger than is the proportion of either self-originated approvals or investigational compounds that are for biophar- 
maceuticals in the Tufts CSDD investigational drug database. The survey firms in this database are predominantly 
traditional pharmaceutical firms. Thus, we estimate clinical phase lengths and approval phase times for new chem- 
ical entities and biopharmaceuticals separately and compute a weighted average of the mean phase lengths, where 
the weights are the shares of self-originated investigational compounds in the Tufts CSDD database for each of 
these compound types. 

20 For five of the sample drugs, the survey firms were not able to disaggregate costs for two successive clinical 
phases (i.e. either phases 1 and II or phases II and III). We developed a two-stage iterative process for imputing 
phase costs for these drugs. To illustrate, suppose that the firm combined phases II and III costs for a specific drug. 
For a year during which the drug was in both phase II and III testing, let m\\ — number of months the drug was in 
phase II only, m\\\ = number of months the drug was in phase III only, wo = number of months the drug was in 
both phases, T = total clinical phase cost for the drug during the year, and cr = ratio of weighted monthly phase 
III to phase II cost for drugs where phase costs were disaggregated. Imputed phase II cost, x\\> can then be defined 
as*ii = (mii +cr-mo)7y(mn H-cr-mm -f [I +cr]-mo). Imputed phase III cost is determined as x\\\ = cr-jcn.The 
same approach was used when phase I and II costs were combined by the responding firm. To further refine the 
results, we included the imputed costs for the five drugs from the first stage and recomputed the phase cost ratios 
to determine second stage values for the imputed costs. The results for imputed costs barely changed between the 
first and the second iterations. 
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Table 1 

Average out-of-pocket clinical period costs for investigational compounds (in millions of 2000 dollars) 3 



Testing phase 


Mean cost 


Median cost 


Standard 
deviation 


N* 


Probability of 
entering phase (%) 


Expected cost 


Phase I 


15.2 


13.9 


12.8 


66 


100.0 


15.2 


Phase II 


23.5 


17.0 


22.1 


53 


71.0 


16.7 


Phase III 


86.3 


62.0 


60.6 


33 


31.4 


27.1 


Long-term animal 


5.2 


3.1 


4.8 


20 


31.4 


1.6 


Total 












60.6 



8 All costs were deflated using the GDP Implicit Price Deflator. Weighted values were used in calculating 
means, medians, and standard deviations. 

b N: number of compounds with full cost data for the phase. 



cost per investigational drug entering a phase increases substantially by clinical phase, 
particularly for phase III, which is typically characterized by large-scale trials. In comparison 
to the previous study (DiMasi et ah, 1991), mean phase I cost is moderately higher relative 
to the other phases. While the ratio of mean phase III cost to mean phase I cost was 6.0 
for the previous study, it was 5.7 here. Similarly, the ratio of mean phase II to phase I cost 
was 1.9 for the earlier study, but was 1.5 for this study. The higher relative phase I cost 
is consistent with other data that indicate that the growth in the number of procedures per 
patient was much greater for phase I than for the other phases during the 1990s. 21 

Mean clinical phase costs increased approximately five-fold in real terms between the 
studies. However, in comparison, long-term animal testing costs incurred during the clinical 
period increased by only 60%. Thus, increases in out-of-pocket clinical period costs were 
driven heavily by increases in human trial, as opposed to animal testing, costs. This suggests 
that preclinical animal studies may also have not increased at anywhere near the same rate 
as have clinical trial costs. The results also indicate that development costs have become 
more uniform across drugs. 

This is indicated by two comparisons with the results from the previous study. The ratio 
of mean to median phase cost decreased 50% for phase I, 22% for phase II, and 13% for 
phase III for the present study in comparison to the earlier study. Thus, the data are less 
skewed. The coefficients of variation for the phases also declined. They are 60% lower for 
phase I, 29% lower for phase II, and 36% lower for phase III. 

Estimates of the probability that an investigational drug will enter a phase were obtained 
from statistical analysis of information in the Tufts CSDD database of investigational com- 
pounds for drugs that met study criteria. They are shown in Table 1 and are used to obtain 
the expected phase costs in the last column. The probabilities are lower in comparison to 
the previous study (75.0% for phase II, 36.3% for phase III, and 56.1% for long-term ani- 
mal testing). Lower probabilities of entering a phase will, other things being equal, result 
in lower expected costs. Thus, while the mean phase costs for drugs entering a phase are 



21 One of the authors obtained data from DataEdge, LLC on the number of procedures administered to patients 
by phase from 1990 to 1997. The data were based on information in the clinical trial grants of a very large number 
of pharmaceutical firms. During this period, the number of procedures per patient increased 27% for phase III, 
90% for phase II, and 120% for phase I. 
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approximately five times higher in this study, the expected cost per investigational drug is 
only four times higher. 

Alternative probability estimates for the same data make clear how reductions in drug 
development risks hold down development costs. Our earlier study showed proportionately 
fewer failures in phase I (32.5% versus 37.0%) and proportionately more failures in phase 
III (17.1% versus 12.6%); the share for phase II was identical. Thus, given a similar overall 
clinical success rate, the evidence suggests that over time firms became better able to 
weed out failures (clinical or economic) early in the process. A similar scenario holds 
when we examine phase transition probabilities. In the earlier study, a larger percentage of 
investigational drugs made it to phase II (75.0% versus 71.0%) and a smaller percentage 
proceeded from phase III to marketing approval (63.5% versus 68.5%). 

5,2. Cost-of-capital estimates 

In our earlier paper (DiMasi et al., 1991), we employed a 9% real cost-of-capital based 
on a capital asset pricing model (CAPM) analysis for a representative group of pharmaceu- 
tical firms during the 1970s and early 1980s. A real rather than a nominal cost-of-capital 
is appropriate in our analysis since R&D costs are expressed in constant 2000 dollars. 
The real cost-of-capital in pharmaceuticals has increased since the mid-1980s primarily 
as a result of higher real rates of return required by holders of equity capital during the 
1990s. 

In the present analysis, we compute a weighted cost-of-capital for each firm in a represen- 
tative group of pharmaceutical firms for the 1980s and 1990s, where the weights are based 
on the firm's market value of debt and equity. For most major pharmaceutical firms, debt 
securities account for less than 10% of market valuation, so that the equity cost-of-capital 
component is the dominant element of the weighted cost-of-capital for this industry. At the 
request of the OTA, Myers and Shyam-Sunder (1996) estimated the cost-of-capital for the 
pharmaceutical industry during the 1970s and 1980s using a standard CAPM approach. 
Their methodology is the basis for our updated analysis. 

In our R&D cost analysis we have a sample of new drugs that began clinical trials in the 
mid-1980s through the early 1990s, and which have an average market introduction point in 
the late 1990s. Hence a relevant time period for our cost-of-capital measure is 1985-2000. 
Accordingly, we estimated the cost-of-capital at roughly 5-year intervals beginning in Jan- 
uary 1985 and ending in January 2000. The results of our analysis are summarized in 
Table 2. 

The nominal cost-of-capital in 1985 and 1990 are based on Myers and Shyam-Sunder's 
analysis for the OTA. The 1994 value is from Myers and Howe (1997). The 2000 nominal 
cost-of-capital (COC) value is based on our own estimation, employing a sample of firm 
and data sources comparable to those used in the prior work of Myers and colleagues. As 
can be seen in Table 2, the nominal cost-of-capital for pharmaceutical firms has remained 
relatively stable in this period in the range of 14-16%, with a mean of approximately 1 5%. 22 

22 We undertook an informal survey of major pharmaceutical firms in mid-2001 with respect to the hurdle rate 
that they used in their R&D investment decisions. This survey of six firms yielded (nominal) hurdle rates from 
13.5 to over 20%. This indicates that a 15% nominal COC rate is within the range of hurdle rates utilized by major 
pharmaceutical firms for their actual R&D investments. 
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Table 2 

Nominal and real cost-of-capital (COC) for the pharmaceutical industry, 1985-2000 





1985 


1990 


1994 


2000 


Nominal COC (%) a 


16.1 


15.1 


14.2 


15,0 


Inflation rate (%) b 


5.4 


4.5 


3.1 


3.1 


Real COC (%) 


10.8 


10.6 


11.1 


11.9 



a The nominal values for 1985 and 1990 are based on Myers and Shyam-Sunder (1996). The nominal value 
for 1 994 is taken from Myers and Howe (1997). The 2000 nominal value is based on our own computations using 
comparable samples and data sources. 

b The inflation rate for 1985 is taken from Myers and Howe (1997), the rate for 1990 is a 5-year average 
centered on January 1990 and is based on the CPI-U, the rate for 1994 and 2000 is the long-term inflation rate 
from 1926 to 2000 (Ibbotson Associates, 2001, p. 17). 



To obtain a real cost-of-capital, we subtracted the expected rate of inflation from the 
nominal cost-of-capital. For this purpose, Myers and Shyam-Sunder (1996) used the ex- 
pected rate of inflation from a special consumer survey performed in the 1980s. We also 
used this value in Table 2 for the 1 985 period. For 1 990 we utilized a 5-year moving average 
of actual inflation rates centered around the year in question to estimate expected rates of 
inflation. For 1994 and 2000 we used the long-term inflation rate (1926-2000) in Ibbotson 
and Associates (2001) of 3.1% to compute the values in Table 2. 23 

The real cost-of-capital for the pharmaceutical industry over this period, using the CAPM 
model, varies from 10.6 to 12.0%. The mean cost-of-capital in this period was just over 1 1%. 
Hence, 1 1% is the baseline value that we employed in our R&D cost estimates. 24 However, 
as in prior studies, we did sensitivity analysis around this value in order to determine how 
our baseline R&D cost estimates are affected by changes in the cost-of-capital. 

5.3. Capitalized clinical cost per investigational drug 

To calculate opportunity cost for clinical period expenditures we estimated average phase 
lengths and average gaps or overlaps between successive clinical phases. Mean phase lengths 
and mean times between successive phases are shown in Table 3. The time between the start 
of clinical testing and submission of an NDA or BLA with the FDA was estimated to be 
72.1 months, which is 3.5 months longer than the same period estimated in the previous 
study. However, the time from the start of clinical testing to marketing approval in our 
timeline for a representative drug averaged 90.3 months for the current study, compared to 



23 Inflation rates were particularly low in the 1990s, and 5-year moving averages were below the long-term 
rate. Since the 1990s represented a marked change in the inflation rate from earlier decades, and inflationary 
expectations may not adjust immediately to the new experience, we used the long-term inflation rate rather than 
5-year moving averages for this period. 

24 This yields conservative estimates of the cost of capital from several perspectives. One important point concerns 
the fact that many major pharmaceutical firms have large positive cash balances and are actually net lenders rather 
than net borrowers (i.e. they have a negative debt ratio). Incorporating this point into their CAPM analysis for 
January, 1990, causes the estimated nominal value of the cost of capital to increase by almost a full percentage 
point (see Myers and Shyam-Sunder, 1 996, p. 223). In addition, as noted in footnote 4, many firms appear to use 
higher costs of capital in their R&D investment decisions than what emerges from this CAPM analysis. 
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Table 3 



Average phase times and clinical period capitalized costs for investigational compounds (in millions of 2000 
dollars) 3 



Testing phase 


Mean phase 
length 


Mean time to 
next phase 


Capitalized mean 
phase cost b c 


Capitalized expected 
phase cost bc 


Phase I ' 


21.6 


12.3 


30.5 


30.5 


Phase II 


25.7 


26.0 


41.6 


29.5 


Phase III 


30.5 


33.8 


119.2 


37.4 


Long-term animal 


36.5 




9.5 


3.0 


Total 








100.4 



a All costs were deflated using the GDP Implicit Price Deflator. Weighted values were used in calculating 
means, medians, and standard deviations for costs and phase times. Phase times are given in months. 

b The NDA approval phase was estimated to be 1 8.2 months. Animal testing was estimated to begin 4.2 months 



after the initiation of phase I. 

c Costs were capitalized at an 1 1% real discount rate. 

98.9 months for the earlier study. The difference is accounted for by the much shorter FDA 
approval times in the mid to late 1990s that were associated with the implementation of the 
Prescription Drug Use Fee Act of 1992. While the approval phase averaged 30.3 months 
for the earlier paper's study period, that phase averaged only 1 8.2 months for drugs covered 
by the current study. 

Other things being equal, the observed shorter times from clinical testing to approval 
yield lower capitalized costs relative to out-of-pocket costs. However, the discount rate that 
we used for the current study is also higher than for the previous study (11% versus 9%). 
The two effects work in offsetting ways, On net, there was very little difference between 
the studies in the ratio of mean capitalized to out-of-pocket cost for the individual clinical 
phases. 25 

5.4. Clinical cost per approved new drug 

Although average cost estimates for investigational drugs are interesting in their own 
right, we are mainly interested in developing estimates of cost per approved new drug. 
To do so, we need an overall clinical approval success rate. Our statistical analysis of 
compounds in the Tufts CSDD database of investigational drugs that met study criteria 
yielded a predicted final clinical success rate of 21.5%. Applying this success rate to our 
estimates of out-of-pocket and capitalized costs per investigational drug results in estimates 
of cost per approved new drug that link the cost of drug failures to the successes. 

Aggregating across phases, we find that the out-of-pocket clinical period cost per ap- 
proved new drug is US$ 282 million and the capitalized clinical period cost per approved 
new drug is US$ 467 million. These costs are more than four-fold higher than those we 
found in our previous study. 



25 The ratios of capitalized to out-of-pocket cost for the earlier study were 1 .9, 1 .7, 1.4, and 1 .6 for phases I— III, 
and animal testing, respectively. For this study, we found the ratios to be 2.0, 1 .8, 1 .3, and 1 .8 for phases I — III, and 
animal testing, respectively. 
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5.5. Preclinical out-of-pocket and capitalized costs per approved drug 

The preclinical period, as defined here, includes discovery research as well as preclinical 
development. As noted above, not all costs during this period can be allocated to specific 
compounds. To deal with this issue, we analyzed aggregate annual firm expenditures on 
self-originated new drugs by the preclinical and clinical periods. We gathered data on 
aggregate expenditures for these periods from survey firms for 1980-1999. Both times 
series tended to increase over time in real terms. Given this outcome, and the fact that 
the clinical expenditures in 1 year will be associated with preclinical expenditures that 
occurred years earlier, the ratio of total preclinical expenditures to total R&D (preclinical 
plus clinical) expenditures over the study period will yield an overestimate of the share of 
total cost per new drug that is accounted for by the preclinical period. To accurately estimate 
this share we built in a lag structure that associates preclinical expenditures with clinical 
expenditures incurred some time later. Using data in the Tufts CSDD database of approved 
drugs, we estimated the average time from synthesis of a compound to initial human testing 
for self-originated drugs to be 52.0 months. Our analysis of clinical phase lengths and phase 
gaps and overlaps indicates a period of 68.8 months over which clinical period development 
costs are incurred. We approximate the lag between preclinical and clinical expenditures 
for a representative new drug as the time between the midpoints of each period. This yields 
a lag of 60.4 months, or approximately 5 years. Thus, we used a 5-year lag in analyzing the 
aggregate expenditure data. Doing so resulted in a preclinical to total R&D expenditure ratio 
of 30%. This share was applied to our clinical cost estimates to determine corresponding 
preclinical cost estimates. Given the estimates of out-of-pocket and capitalized clinical cost 
per approved new drug noted in Section 5.4, we can infer preclinical out-of-pocket and 
capitalized costs per approved new drug of US$ 121 and 335 million, respectively. The 
results are very robust to different values for the length of the lag structure. For example, if 
we assume a lag of 4 years instead of 5 years, then out-of-pocket preclinical costs would be 
9.8% higher. Alternatively, if we assume a 6-year lag, then out-of-pocket preclinical costs 
would be 9.3% lower. 

5.6. Total capitalized cost per approved drug 

Our full cost estimate is the sum of our preclinical and clinical period cost estimates. Our 
base case out-of-pocket cost per approved new drug is US$ 403 million, while our fully 
capitalized total cost estimate is US$ 802 million. Time costs, thus, account for 50% of total 
cost. This share is nearly identical to one that we found in our previous study (51%). This 
is the case even though the time cost shares for both the clinical and preclinical periods are 
somewhat higher for this study. The explanation for this seeming inconsistency is that time 
costs are relatively greater for preclinical expenditures since they are incurred earlier in the 
process, but the preclinical share of total costs is lower for the present study. 

5. 7. Trends in R&D costs 

Fig. 2 presents the primary results (capitalized preclinical, clinical, and total cost per ap- 
proved new drug) for the previous two studies and for our current study. In inflation-adjusted 
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Fig. 2. Trends in capitalized preclinical, clinical and total cost per approved new drug. 



terms, total capitalized cost was 2.3 times higher for the previous study in comparison to 
the first study. Real total capitalized cost per approved new drug for the current study is 2.5 
times higher than for the previous study. However, the samples for these studies include 
drugs that entered clinical testing over periods that are not uniformly dispersed. In addition, 
while the samples were chosen on the basis of when drugs entered clinical testing, changes 
over time in the average length of the development process make ascribing differences in 
the study periods according to the year of first human testing problematic. An alternative is 
to determine an average approval date for drugs in each study's sample and use the differ- 
ences in these dates to define the time differences between the studies. This will allow us 
to determine annual cost growth rates between successive studies. 

Drugs in the current study sample obtained FDA marketing approval from 1990 to 2001, 
with the vast majority of the approvals occurring between 1992 and 2000. The mean and 
median approval date for drugs in the current study's sample was in early 1997. For the 
previous study, we reported that the average approval date was in early 1984. Thus, we used 
13 years as the relevant time span between the studies and calculated compound annual 
rates of growth between the two studies accordingly. 

Hansen (1979) did not report an average approval date; however, we can infer a period 
difference by noting the sample selection criteria and the difference in development times 
between that study and the DiMasi et al. (1991) study. The sample selection criteria for 
DiMasi et al. (1991) involved a 7-year shift in initial clinical testing relative to Hansen 
(1979). However, the estimated time from the start of clinical testing to marketing approval 
was 2.3 years longer for the DiMasi et al. (1991) study. Thus, we use 9.3 years as the 
difference between the study periods for these two studies. 

Using these period differences, we found that the compound annual growth rates in 
total out-of-pocket cost per approved drug were quite similar across the studies (Table 4). 
The growth in total costs, however, masks substantial differences in growth rates for the 
preclinical and clinical periods. While out-of-pocket preclinical expenditures continued to 
grow in real terms, its growth rate for the current study relative to the previous one declined 
by two-thirds in comparison to the growth rate for the first two studies. Conversely, the 
growth rate for clinical period expenditures approximately doubled for the two most recent 
studies. 
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Table 4 



Compound annual growth rates in out-of-pocket and capitalized inflation-adjusted costs per approved new drug 3 



Period 


Out-of-pocket 






Capitalized 








Preclinical (%) 


Clinical (%) 


Total (%) 


Preclinical (%) 


Clinical (%) 


Total (%) 


1970-1980 


7.8 


6.1 


7.0 


10.6 


7.3 


9.4 


1980-1990 


2.3 


11.8 


7.6 


3.5 


12.2 


7.4 



a Costs for the 1970s approvals are from Hansen (1979), costs for the 1980s approvals are from DiMasi et al. 
(1991), and costs for the 1990s approvals are from the current study. 



Annual growth rates for capitalized costs are also shown in Table 4. The results show a 
substantially higher growth rate for clinical costs for the two most recent analyses. However, 
while the growth rate for total out-of-pocket cost per approved drug was slightly greater 
for the two most recent studies, the growth rate in total capitalized cost was two percentage 
points higher between the first and second study than between the second and third. This is 
so, despite the fact that the discount rate increased one percentage point between the first 
two studies, but two percentage points between the last two. The growth rate in capitalized 
costs, however, is driven more by the fact that preclinical costs have a lower share of total 
out-of-pocket costs in the current study than in the previous studies, and time costs are 
necessarily proportionately more important for preclinical than for clinical expenditures. 



6. Sensitivity analysis 

6.1, Effects of parameter changes 

We undertook sensitivity analyses for several of the key parameters that underlie the cost 
estimates. Fig. 3 shows how preclinical, clinical, and total capitalized costs would vary by 
discount rate at half-percentage point intervals. The values for a zero percent discount rate 
are out-of-pocket costs. In the neighborhood of our base case discount rate (1 1%), clinical 
cost changes by about US$ 10 million, preclinical cost changes by about US$ 15 million, 
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Fig. 3. Capitalized preclinical, clinical, and total costs per approved new drug by discount rate. 
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and total cost changes by about US$ 25 million for every half of one percent shift in the 
discount rate. In our previous study, the base case discount rate was 9%. At a 9% discount 
rate, total capitalized cost here is US$ 707 million, or 1 1 .8% less than our base case result. 
The results in section 5.3 provide some support for an even higher discount rate than our 
base case value. At a 12% discount rate, total capitalized cost per approved new drug is 
US$ 855 million, or 6.6% higher than our base case result. 

The clinical approval success rate is another key parameter. We analyzed the effects of an 
approximate 10% change in the success rate at various discount rates. A higher success rate 
has a somewhat smaller impact on total cost than does a correspondingly lower success rate. 
At our base case discount rate, total capitalized cost for a success rate of 23.5% is US$ 734 
million, or 8.5% lower than our base case result. At a success rate of 1 9.5%, total capitalized 
cost is US$ 885 million, or 10.3% higher than our base case result. The estimated clinical 
success rate for our previous study was 23.0%. At that success rate, total capitalized cost 
here is US$ 750 million, or 6.5% less than our base case result. 26 

The methodology for determining the total capitalized cost estimate is dependent on 
values for 20 parameters. However, not all of them are independent of one another. It is 
possible to determine total capitalized cost from estimates of 1 6 parameters. To get a measure 
of statistical error for overall cost, we performed a Monte Carlo simulation (1000 trials) 
for total capitalized cost by taking random draws from the sampling distributions of the 
16 parameters and computing a total cost estimate for each simulation trial. 27 Ninety-five 
percent of the total cost estimates for the simulation fell between US$ 684 and 936 million, 
90% fell between US$ 705 and 917 million, and 80% fell between US$ 717 and 903 
million. 28 The interquartile range was US$ 757-854 million. 



26 These analyses indicate what the results would be if the clinical success rate is changed, while other parameters 
remain the same. If the phase attrition rates are adjusted to be consistent with the new clinical success rate while 
maintaining the same distribution of failures across phases, then the differences in cost are somewhat lower. For 
example, if the clinical success rate is 23.0% and phase attrition rates are altered accordingly, total capitalized cost 
is 5.6% lower (5.1% lower if account is also taken of estimated differences in phase costs between the failures and 
successes in the sample [see the following section]). 

27 The clinical success rate parameter is determined from the values of four asymptotically normal coefficient 
estimates. We performed an initial Monte Carlo simulation for the clinical success rate using these coefficient 
estimates and their standard errors to obtain a sampling distribution for the success rate. The sampling distribution 
for the discount rate was chosen by assumption. Given that the base case choice of discount rate may be somewhat 
conservative (see the discussion above), we chose a triangular distribution for the discount rate that varied from 
10.0 to 12.5%, with the modal value for the distribution chosen so that the mean discount rate is approximately 
1 1 .0% in the simulations for total capitalized cost. The other sampling distributions were for estimated means and 
binomial probabilities. Finite population correction factors were applied to the standard errors. 

28 The simulation was conducted assuming statistical independence for the parameters. The out-of- pocket phase 
cost, development time, and success and attrition rate parameters were estimated from separate datasets, and so 
their independence of one another is likely. It is possible that out-of- pocket phase costs are correlated. We therefore 
also conducted a simulation using the estimated correlations across phases for those pairs that were found to have 
correlations that were statistically significantly different from zero (phases I and II [0.496], phases II and III [0.430], 
phase II and long-term animal testing [0.656]). This increased the variability of the total capitalized cost estimates 
only slightly. Specifically, the coefficient of variation increased from 0.088 to 0.099. The main simulation results 
were affected most by variability in individual phase costs, and least by variability in development times. The coef- 
ficient of variation when only development times vary, when only the discount rate varies, when only success and 
attrition rates vary, and when only out-of-pocket phase costs vary were 0.0 1 5, 0.035, 0.044, and 0.065, respectively. 
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6.2. Variable discount rates 

Myers and others (Myers and Shyam-Sunder, 1996; Myers and Howe, 1997) have ar- 
gued that the cost-of-capital for R&D should decline over the development process as a step 
function. They termed the relationship a risk-return staircase. In the case of pharmaceutical 
R&D, the staircase is not related to the usual notions of risk in pharmaceutical development 
(i.e. the probabilities of approval at different points in the process). These technical risks 
can be diversified away by investors, who can spread their investments over many firms. 
Rather, the rationale has to do with the notion that at any point in the development process 
future R&D costs serve as a kind of leverage, or debt, if the firm wishes to proceed with 
development and market a product. A more levered position amplifies risk and is associated 
with a higher cost-of-capital for investors. Since the leveraging declines over the develop- 
ment process, so does the cost-of-capital. Technical risks play a role only in that they affect 
expected future costs. 

The valuation problem may also be viewed as a compound option pricing problem. 
The firm effectively faces call options at decision points during development, where the 
exercise price is the cost of future R&D. Myers and Howe (1997) suggest a means for 
dealing with the problem that reduces the informational requirements to knowledge of 
two-discount rates. One of these is the discount rate for net revenues on a marketed drug 
(''nr). The other is the discount rate on future costs (rpc)- The rate for net revenues 
should be somewhat less than the overall company COC. The rate for future costs, be- 
ing an expected return on what is nearly a fixed debt obligation, is likely lower. Under 
certain assumptions, the Myers and Howe (1997) two-discount rate method yields the 
same results as the more complex compound options valuation. We view this approach 
to discounting as experimental for our purposes. To our knowledge, no pharmaceutical 
firm uses such an approach for its project evaluations. In addition, although they may be 
guided by real world information, the selections of the two-discount rates are judgment 
calls. 29 

For purposes of comparison, we did compute drug R&D costs with the Myers and 
Howe (1997) two-discount rate method. Their base case values for hsir (9%) and rpc 
(6%) were meant to be relevant for 1994, which corresponds roughly with the middle 
of our study period. Thus, we computed the total capitalized pre-approval cost per ap- 
proved drug using these values and other close combinations in a sensitivity analysis. 
At the Myers and Howe (1997) base case values, total capitalized cost is margi- 
nally higher than our estimate computed at an 11% COC (US$ 815 million). However, 
the total capitalized cost estimate is US$ 955 million when a 10% discount rate is used 
for hs[r and a 5% discount rate is used for rpc- Conversely, at an 8% discount rate for 
rNR and a 7% discount rate for rpc, the total cost estimate is US$ 696 
million. 



29 For their financial life-cycle simulation model, Myers and Howe (1997) chose base case values for /*nr and 
rpc partly on the basis of judgment and partly because these values generated realistic company costs-of-capital 
for mature pharmaceutical firms in their simulations. These simulations required assumptions about revenue 
distributions and other factors that affect profitability. 
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7. Extensions to the base case 

The base case results on overall pre-approval drug development costs can be extended in 
several interesting ways. Our base case results link the costs of the failures to the successes. 
We can provide estimates of the clinical period cost of taking a successful drug all the way 
to approval by examining the data for the approved drugs in the sample. This also allows us 
to obtain some evidence on costs for the more medically significant products (according to 
what is known at the time of approval) by using an FDA prioritization ranking for approved 
drugs. We can also use data collected from our survey to estimate R&D expenditures on 
new drugs subsequent to original marketing approval. Finally, we can examine what impact 
tax policies and procedures have had on the effective cost of pharmaceutical R&D for 
pharmaceutical firms. 

7.1. Development costs for successes 

As our results indicate, development costs vary across drugs. Thus, it is worthwhile to 
examine specific subclasses of drugs, where one may reasonably conjecture that the cost 
structure is different than it is for drugs as a whole. In particular, we investigated the clinical 
cost structure for successful drugs (i.e. drugs that have made it through testing and obtained 
marketing approval from the FDA). We also examined these data classified by an FDA 
rating of therapeutic significance for drug approvals. 

Of the 68 drugs in our sample, 27 have been approved for marketing. We had complete 
phase cost data for 24 of the approvals. Clinical phase cost averages and standard deviations 
for the approved drugs in the sample are shown in Table 5. For comparative purposes, the 
results for the full sample are also shown. Except for phase I, clinical phase costs are 
notably higher for the approved drugs than for drugs as a whole. Phase II and III costs for 
the approved drugs are 77 and 1 8% higher, respectively. This result is qualitatively consistent 
with what we found in our previous study. An explanation that we offered therein may still 
be appropriate. The results may reflect a tendency to prioritize development by directing 
more resources, possibly by conducting more studies concurrently, to investigational drugs 
that appear, after early testing, to be the most likely to be approved. Since we are not linking 

Table 5 



Out-of-pocket clinical period phase costs for approved compounds (in millions of 2000 dollars) 3 



Testing phase 


Approved drugs b 




Full sample 0 






Mean cost 


Median cost 


Standard 
deviation 


Mean cost 


Median cost 


Standard 
deviation 


Phase I 


15.2 


11.7 


14.3 


15.2 


13.9 


12.8 


Phase II 


41.7 


31.5 


30.2 


23.5 


17.0 


22.1 


Phase III 


115.2 


78.7 


95.0 


86.3 


62.0 


60.6 


Long-term animal 


4.4 


0 


5.4 


5.2 


3.1 


4.8 



a All costs were deflated using the GDP Implicit Price Deflator. 

b Estimates for the approved drugs are based on data for 24 of the 68 sample drugs. 

c Weighted values were used in calculating means, medians, and standard deviations for the full sample. 
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failures to successes here and since we have full phase cost data for the 24 approved drugs, 
we can add phase costs for each drug to determine a total clinical period cost for each 
drug and use those data to find confidence intervals for mean out-of-pocket and capitalized 
clinical period cost for approved drugs. Mean out-of-pocket clinical period cost for the 
approved drugs was US$ 176.5 million, with a 95% confidence interval of US$ 126-227 
million. We used actual phase timing for individual approved drugs, rather than averages 
over all approved drugs, to capitalize costs for individual approved drugs. Doing so yielded 
a mean clinical period capitalized cost of US$ 25 1 .3 million, with a 95% confidence interval 
ofUS$ 180.2-322.4 million. 

The FDA prioritizes new drugs by therapeutic significance at the time of submission of an 
application for marketing approval. 30 New drugs are rated as either priority (P) or standard 
(S). 31 Kaitin and Healy (2000), Kaitin and DiMasi (2000), Reichert (2000), and DiMasi 
(2001a) contain numerous analyses of development and approval times by FDA therapeutic 
rating. However, the only prior analysis of development costs by therapeutic rating was in 
our previous study. We found higher mean clinical phase costs for more highly rated drugs. 
The results for this sample also show higher costs. Mean clinical period out-of-pocket cost 
for approved drugs with a P rating was US$ 207 million, compared to US$ 155 million for 
drugs that had received the S rating. 

The differential was less for capitalized costs. Mean clinical period capitalized cost was 
US$ 273 million for drugs with a P rating and US$ 236 million for those with the S rating. 
In both cases, the confidence intervals for P and S rated drugs overlap. However, given the 
substantial variability in drug development costs and the fact that the number of compounds 
in each category was small ( 1 0 drugs with a P rating and 1 4 with an S rating), this outcome is 
not surprising. However, it is plausible that, on average, testing a priority-rated drug breaks 
more new scientific ground and so is costlier, as firms must learn through experience. It may 
also be the case that firms have the incentive to do more wide-ranging and costly testing on 
drugs that have the potential to be both clinically and commercially significant. Our results 
can then be viewed as supportive, but not conclusive, evidence of higher costs for drugs 
with higher therapeutic significance ratings. 

7.2. Cost of post-approval R&D 

Our main objective was to estimate pre-approval R&D costs. However, our pre-approval 
estimates together with other pharmaceutical industry data regarding the drug develop- 
ment process allowed us to construct an estimate of the cost of post-approval R&D, and 
thereby obtain an estimate of average total R&D cost per new drug covering the entire 



30 The process is intended to provide direction for internal prioritization of marketing approval reviews by the 
FDA. The Prescription Drug User Fee Act of 1992 and its reauthorization in 1997 include performance goals for 
the FDA that are defined in terms of the therapeutic ratings. 

31 In late 1992 the FDA switched from a three-tiered rating system (A, B, C) to the current two-tiered system (P, 
S). Drugs that were rated A were judged to represent a significant gain over existing therapy, those rated B were 
judged to represent a moderate gain over existing therapy, and those rated C were judged to represent little or no 
gain over existing therapy. Our sample includes drugs that were rated under the old system. We assigned drugs 
that had received an A or B rating to the P category, and drugs that had received a C rating to the S category under 
the current system. 
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Fig. 4. Out-of-pocket and capitalized total cost per approved new drug for new drugs and for improvements to 
existing drugs. 

development and marketing life-cycle. The aggregate annual expenditure data that we col- 
lected for the cost survey firms show that these firms spent approximately three-quarters of 
their prescription pharmaceutical R&D expenditures on self-originated new drugs, 10% on 
investigational drugs that are licensed-in or otherwise acquired, and 15% on improvements 
to drugs that have already been approved. 

We cannot, however, use the percentage of aggregate R&D expenditures spent on post- 
approval R&D on a current basis and apply it to our pre-approval cost estimate to obtain an 
estimate of the cost of post-approval R&D per approved drug. The reason is that pre-approval 
costs occur years before post-approval costs. We may use our aggregate annual firm R&D 
data, but we must build in a reasonable lag structure. Our methodology for doing so is dis- 
cussed in detail in an appendix that is available from the authors upon request (Appendix A). 

We used a 1 0-year lag for the aggregate data (approximate time between median pre- 
approval development costs and median post-approval costs), assumed that post-approval 
R&D cost per approval is the same, on average, for licensed-in and self-originated drugs, 
and determined the percentage of approvals for the cost survey firms that are self-originated 
to estimate the ratio of post-approval R&D cost per approved drug to pre-approval cost 
per approved drug. The data indicated that this share was 34.8%. Thus, we estimated the 
out-of-pocket cost per approved drug for post-approval R&D to be US$ 140 million (Fig. 4). 
Since these costs occur after approval and we are capitalizing costs to the point of market- 
ing approval, our discounted cost estimate is lower (US$ 95 million). Thus, out-of-pocket 
cost per approved drug for post-approval R&D is 25.8% of total R&D cost (pre- and 
post-approval), while capitalized cost for post-approval R&D is 10.6% of total cost. 

7.3. Tax analysis 

The cost estimates that are presented here are pre-tax. As noted above, OTA (1993) used 
the basic data and methodology from our previous study in their report, but the OTA also 
reported an after-tax figure determined by subtracting a percentage of pre-tax capitalized 
cost. The percentage was an assumed average effective corporate income tax rate for the 
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period. Hence, a straightforward calculation can be made to use our R&D cost estimates 
as inputs in after-tax analyses of R&D rates of return (OTA, 1993; Grabowski and Vernon, 
1 994). However, some have suggested that an after-income tax figure is the relevant measure 
of pharmaceutical industry R&D cost (Public Citizen, 2001). 

As a stand-alone estimate for R&D cost, we find such a figure to be inadequate for 
our purposes and potentially misleading. First, we are primarily interested in trends in 
private sector resource costs associated with getting a new drug to regulatory marketing 
approval. Tax rates and tax structures can change over time, so trends in resource costs can 
be masked by after-tax figures. Second, even if the objective is to measure the effective cost 
to companies, that cost is not properly measured by subtracting the corporate income tax 
deduction for R&D from the resource cost estimate. It can also be misleading, as it may 
suggest that government is subsidizing corporate R&D by the amount of the deduction. 
The corporate income tax is intended to be a tax on profits. Deductions for R&D and other 
business costs are the means used to approximate the appropriate base for the tax (revenues 
minus costs). Thus, cost deductions on corporate income tax statements cannot be properly 
viewed as tax breaks. 

The only potential tax advantage with respect to administration of the corporate income 
tax involves the timing of tax payments. R&D is an investment, but firms are allowed to 
deduct R&D costs (excluding plant and equipment) as current expenses in lieu of depreci- 
ating these investment costs over time. Nevertheless, the value of this timing effect should 
be significantly less than the total deduction. 32 The accounting informational requirements 
needed to appropriately depreciate an intangible asset such as R&D are so formidable that 
expensing of R&D is allowed under accounting guidelines. The true economic depreciation 
schedule likely varies significantly by industry, by firms within an industry, and by project 
within a firm. Thus, the practice of allowing what is in effect a 100% depreciation rate in 
the first year can be viewed as a second-best solution for an otherwise intractable issue. 

A portion of the US tax code that is intended to serve as a stimulus to innovation by 
effectively subsidizing R&D is the Research and Experimentation (R&E) tax credit. The 
R&E tax credit was not relevant to a significant degree to the study period for our previous 
analysis (DiMasi et al., 1991). However, it is almost fully applicable to the study period 
for the current analysis. The credit is generally determined as a percentage of the excess 
of qualified R&D expenditures in a year over a base amount. It is difficult to adequately 
assess the quantitative impact of this tax policy. Over the history of the implementation of 
the R&E tax credit, the percentage credited has changed, as has the method for determining 



32 In theory, optimal administration of the tax would involve depreciating all forms of intangible capital at 
economically appropriate rates. However, tax savings relative to the theoretical optimum should be measured in a 
tax revenue-neutral context. If intangible capital were depreciated rather than expensed, then the present value of 
tax revenues would increase. To keep revenues constant, the tax rate would have to be lowered. If all industries were 
identical with respect to the degrees to which they utilized intangible capital of all types, then tax burdens would 
not be any different in the alternative state (abstracting from any induced secondary effects on the distribution of 
industry allocations between tangible and intangible capital or between labor and capital). The pharmaceutical 
industry, however, is almost certainly above-average in terms of investment in intangible capital (Clarkson, 1977). 
If the optimal state is attainable at reasonable cost, the tax savings to the pharmaceutical industry, then, is not the 
difference in the present values of its tax burden as between the current state and the optimum at the current tax 
rate, but something less that depends on the extent to which the pharmaceutical industry is above-average with 
regard to investment in intangible capital. 
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the base amount. 33 It seems unlikely, though, that the credit has had a substantial economic 
impact on large multinational pharmaceutical firms. 34 

Since 1983 an orphan drug tax credit has also been available to manufacturers for clinical 
trial expenses related to the development of drugs for orphan indications (fewer than 200,000 
patients afflicted in the United States or where it can be demonstrated that development is 
not profitable). However, for a number of reasons the empirical significance of this credit 
for the type of firm surveyed for this study is likely to be very small. 35 Analysis of data 
provided in a Congressional Research Service (CRS) report indicates that orphan drug tax 
credits amount to a fraction of a percent of pharmaceutical industry R&D expenditures 
(Guenther, 1999). 36 



33 In the early implementation years the credit percentage was 25%, but that was lowered to 20% in 1987. The 
base amount had been an average of research expenditures that met certain criteria for the three previous tax years. 
In most instances it now essentially involves applying an historical R&D-sales ratio (any 5 years from 1983 to 
1988) to the average of gross receipts for the previous 4 tax years. The credit can be applied only to the excess 
of current "qualified research expenses" over the base amount. A variety of R&D expenditures are excluded from 
consideration. For example, management expenses other than first-line supervision of those directly engaged in 
research activity, some computer software development costs, and 35% of research expenses contracted out to 
for-profit firms are not counted. The credit also does not apply to research conducted outside the United States, 
Puerto Rico, or any possession of the United States. In addition, firms will typically elect to reduce the allowed 
credit by the maximum corporate income tax rate (currently 35%). If they do not, then they must reduce the 
research expenses that they deducted on their corporate income tax statements by the amount of the credit. 

34 Many firms do not separately report R&E tax credits in their published financial data. We did find R&E 
credits reported in the public financial statements of seven large pharmaceutical firms for each year from 1999 
to 2001 (GlaxoSmithKJine, Johnson & Johnson, Lilly, Pfizer, Pharmacia, Schering-Plough, and Takeda) and for 
2001 for American Home Products (now Wyeth). We compared the credit amounts to the firms' reported R&D 
expenses. R&E credits as a percentage of R&D expenditures varied somewhat by firm and year (0-5.2%). Overall, 
the tax credits amounted to 2.0% of R&D expenditures. Adding Merck, which reported on a broader category 
(General Business Credits), increased the share only to 2.1%. One might argue that prescription pharmaceutical 
R&D could contribute more to the accumulation of R&E tax credits than is indicated by these data. This might be 
so if prescription pharmaceutical R&D expenditures grow more rapidly than the firms' other R&D expenditures 
(this effect would be mitigated, though, in the long-run if pharmaceutical sales also increase at a rate that is 
greater than for the firms' other businesses). We do not know if this has been the case. However, even if it has, 
that impact could be more than reversed if firms have made greater use of outsourcing in pharmaceutical than in 
non-pharmaceutical R&D. By all accounts, pharmaceutical firms have contracted out drug development activities 
at a rapidly growing rate over our study period, and the share of pharmaceutical R&D expenditures currently 
accounted for by outsourcing is substantial. As noted above, a significant share of outsourced R&D is excluded 
from the tax credit calculations. 

35 Unless it can be demonstrated that it is necessary to go outside the United States to find patients, the credit 
(50% of qualified clinical trial expenses) is not available for foreign trial costs. It is also cannot be applied to 
clinical testing on any non-orphan indications for a compound with an orphan drug designation. In addition, the 
vast majority of the manufacturers with products that have received orphan drug designations are biotech firms or 
small niche pharmaceutical firms (see http://www.fda.gov/orphan/designat/list.htm). For development as a whole, 
it is highly likely therefore that the share of R&D expenditures for which the orphan drug credit was applicable 
for traditional large multinational pharmaceutical firms is quite low. 

36 The report includes data on both orphan drug tax credits and taxable income for the pharmaceutical industry for 
1 990-1 994. The CRS also noted in its report that 20.3% of US pharmaceutical industry domestic sales and exports 
were spent on R&D in 1997. Applying this R&D-sales ratio to the data on taxable income suggests that orphan 
drug credits amounted to 0.3% of R&D expenditures. This is a conservative estimate for large pharmaceutical 
firms since taxable income is determined by deducting business expenses from sales, and since, as noted above, 
biotechnology and small pharmaceutical firms obtain a disproportionate share of the credits. 



176 



J.A. DiMasi et al. /Journal of Health Economics 22 (2003) 151-185 



8. Validation 

In their 1993 report, the OTA reviewed the literature on pharmaceutical R&D costs. In 
addition to critiquing the methodologies used in these studies, the review addressed evidence 
on the reasonableness of the studies, particularly the DiMasi et al. (1991) study. The OTA 
concluded that, "the estimates by DiMasi and colleagues of the cash outlays required to bring 
a new drug to market and the time profile of those costs provide a reasonably accurate picture 
of the mean R&D cash outlays for NCEs first tested in humans between 1970 and 1982" 
(OTA, 1993, p. 66). The OTA provided varied data and analyses to corroborate the results 
in DiMasi et al. (1991). We corroborate the basic cost results in this study by examining 
the representativeness of our sample firms and by analyzing various independently derived 
results and data about the industry and the drug development process. We pay particular 
attention to data that corroborate the growth in costs between the previous study and the 
current one. 

8.L Internal validation 

The Tufts CSDD database of investigational compounds, from which our sample was 
selected, contains data on the vast majority of new drugs developed in the United States 
(DiMasi, 2001a). The distribution of investigational drugs across therapeutic classes for our 
10 survey firms is very close to the distribution for all drugs in the database. We examined 
the data for eight specific therapeutic classes and one miscellaneous class for drugs in the 
database that met study inclusion criteria. There are 530 compounds in the database that meet 
these criteria and for which a therapeutic class could be identified (272 of these compounds 
belong to the 10 cost survey firms). The largest difference in share for a specific class 
between all firms in the database and the cost survey firms was 1 .5%. 37 Using a chi-squared 
goodness-of-fit test comparing the therapeutic class distributions for the cost survey firms 
and the other firms in the database, we found no statistically significant difference for the 
class shares (x 2 = 5.01, d.f. = 9). 38 

Based on publicly available data, we also found that pharmaceutical R&D expenditure 
growth rates for the survey firms as a whole were similar to the reported growth rates for all 
PhRMA member firms. For example, the annual growth rate in real pharmaceutical R&D 
expenditures for the survey firms 39 from 1995 to 2000 is 1 1.3%, compared to 1 1.0% for 
PhRMA member firms over the same period. 40 

37 The population shares for the analgesic/anesthetic, antiinfective, antineoplastic, cardiovascular, central nervous 
system, endocrine, gastrointestinal, immunologic, miscellaneous, and respiratory classes are 9.1, 12.8, 9.4, 23.2, 
17.9, 7.0, 2.1, 3.0, 9.4, and 6.0%, respectively. The corresponding shares for the cost survey firms are 9.6, 14.3, 
8.1, 22.8, 19.1, 7.4, 2.2, 3.3, 7.7, and 5.5%, respectively. 

38 The estimated clinical success rate for all firms in this dataset (21.5%) is also very close to the estimated 
success rate for the 10 firms using the same inclusion criteria (22.2%). 

39 The data are for nine of the 10 firms. We did not find pharmaceutical R&D data for one of the firms, but 
this firm has a relatively small pharmaceutical subsidiary whose inclusion would not materially affect the results. 
The data were taken from Scrip's Pharmaceutical Company League Tables (various years) and company annual 
reports. 

40 The annual growth rate for 1 995-1 999 was slightly lower for the survey firms compared to all PhRMA member 
firms (1 1 .5% versus 1 1 .8%). 



J A. DiMasi et al /Journal of Health Economics 22 (2003) 151-185 



177 



8.2. External validation 

Publicly available data that were collected independently can be examined to deter- 
mine the extent to which they are consistent with our results in terms of levels or rates of 
change. Specifically, we examined independent information on clinical trial sizes, measures 
of clinical trial complexity, and published trade association data on R&D employment and 
expenditures. 

8.2.1. Clinical trial sizes and complexity 

Several groups have compiled data on clinical trial sizes for new molecular entities 
approved in the United States for periods that range from the late 1970s to 2001 (BCG, 1993; 
OTA, 1993; Peck, 1997; PAREXEL, 2002). 41 Averaging the BCG results for 1981-1984 
and 1985-1988 (2277) and comparing them to average of the Peck (1997) and PAREXEL 
(2002) results for 1994-1995 and 1998-2001 (5603) yields an annual growth rate in clinical 
trial sizes of 7.47% per year. 42 We may approximate the increases in cost per subject over 
time by examining the excess of medical care inflation over general price inflation. The 
medical care component of the CPI increased at an average annual rate of 6.73% from 1984 
to 1997, while general price inflation (applying the price index used to deflate costs for this 
study) rose at an annual rate of 3.06% over the same period. Thus, other things being equal, 
these results suggest an increase of 1 1 .4% per year in clinical trial costs. This compares to 
our finding of an 1 1.8% annual growth rate in out-of-pocket clinical period cost between 
DiMasi et al. (1991) and the current study. 

These separate estimations need not be in perfect agreement because our clinical cost 
figures include costs not directly related to the number of clinical trial subjects (infrastructure 
costs, fixed costs related to production of clinical trial supplies, animal testing during the 
clinical period, etc.). In addition, there could be some economies of scale in clinical testing 
that would result in a somewhat lower growth in cost per subject. However, data compiled 
by DataEdge, LLC (PAREXEL, 2002, p. 96) indicate that the complexity of clinical trials 



41 Each of these sources obtained data for a sample of the US approvals during specific periods. The BCG found 
the mean number of subjects included in NDAs to be 1576 for 1977-1980, 1321 for 1981-1984, and 3233 for 
1985-1 988. OTA ( 1 993) compared clinical trial sizes for NDAs for three therapeutic categories (antihypertensives, 
antimicrobials, and nonsteroidal antiinflammatories) over two periods. In aggregate, it found the mean number 
of subjects to be 2019 for 1978-1983 approvals (n = 28) and 3128 for 1986-1990 approvals {n = 25). Peck 
(1997) found the mean number of subjects to be 5507 for 12 of 50 1994-1995 approvals. PAREXEL (2002) 
has examined the number of subjects in NDAs for 55% of the new molecular entities approved by the FDA 
in each year from 1998 to 2001. For the period as whole, the mean number of subjects is 5621 (n = 64). 
The latter two averages are similar to what we have found as the mean number of subjects across all three 
clinical phases for the investigational drugs in our cost survey (5303). CMR (2000) found the mean number of 
subjects to be 4478 for 23 marketing approval applications submitted from 1995 to 2000. However, only nine 
of the submissions were to the FDA, with the remainder submitted to European Union and Japanese regulatory 
authorities. Since pre-approval costs are measured here up to the point of US regulatory approval, we use the US- 
based data. 

42 These groupings were chosen so that the mean approval years were 1984 and 1997 (the average approval 
years for the DiMasi et al. (1 991) and the current cost samples). The difference in the two periods was taken to be 
12.5 years. For the early period, we prefer the BCG data to the OTA data, since the OTA data apply to only three 
therapeutic categories that likely tend, in aggregate, to have above-average clinical trial sizes. 
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has increased significantly in recent years. Their index of clinical trial complexity 43 for 
phases I— III increased at an annual rate of 4.8% per year from 1992 to 2000. An increase 
in clinical trial complexity will contribute to even higher growth rates for clinical costs. 44 

8.2.2. Growth in industry R&D employment costs 

Despite rapid growth in outsourcing of R&D activities over the last few decades, phar- 
maceutical firms have significantly expanded the number of their own employees devoted 
to the R&D function. In its industry profile and annual survey reports over various years, 
PhRMA has provided annual information on the R&D employment of its member firms. 
From 1980 to 2000, total R&D employment increased at a compound annual rate of 5.4%, 
with scientific and professional staff increasing at a 7.4% annual rate 45 

We adjusted National Science Foundation (NSF) data on median annual salaries for 
full-time employed biological scientists with doctorates working in for-profit life sciences 
industries from 1993 to 1999 for inflation (GDP Implicit Price Deflator) 46 Real salaries 
increased at a rate of 1.75% per year over this period. The OTA presented similar data 
for every 2 years from 1973 to 1989 (OTA, 1993; pp. 62-63). The real growth rate in 
median annual salaries for biological scientists with doctorates employed in business or 
industry from 1981 to 1989 was 1.77%. Applying a real growth rate of 1.76% per year for 
compensation to a growth rate of 7.4% per year in employment yields a growth rate of 9.3% 
per year for labor costs. This is moderately higher than the growth rate of 7.6% per year 
that we found for total out-of-pocket cost per approved drug between our previous study 
and the current one. 47 Thus, some labor costs have grown fairly rapidly. Most of the growth 
in labor costs, though, has been due to increasing the labor force devoted to R&D, rather 
than to increases in real wages. 



43 The index is based on the mean number of medical procedures to be applied to patients in clinical trial protocols. 
Some of these procedures will be covered by insurance, but the index should provide at least a rough indicator of 
the degree to which the clinical trial process is increasing in complexity. 

44 DataEdge has also compiled information on certain clinical trial costs (investigator fees and central laboratory 
costs). Changes in cost due to increases in clinical trial complexity will be at least partially reflected in these data. 
PAREXEL (2002) reports their index of mean costs per subject across all clinical phases (I— IV) for each year 
from 1996 to 2000. The index increased at an average annual real rate of 5.33% over this period. Combining this 
growth rate with the above growth rate for clinical trial sizes suggests a 1 3. 1% average annual real rate of increase 
in clinical trial costs. Piecing together the index values for years reported in earlier editions of PAREXEL (2002) 
yields a 3.54% real growth rate for 1993-2000. This would imply an 1 1.2% average annual real growth rate in 
clinical trial costs. 

45 Over our study period, highly trained personnel have comprised an increasingly large component of the 
pharmaceutical industry in-house R&D labor force. The share of total R&D personnel for the scientific and 
professional category in the PhRMA data increased from 56.3% in 1980 to 81.8% in 2000. 

46 The data were compiled for 1993, 1995, 1997, and 1999 by the NSF through surveys of doctoral scientists 
and engineers in the United States (National Science Foundation, various years). The NSF used a new survey 
instrument for 1993 and later. Data for every 2 years from 1973 to 1989 used somewhat different occupational 
definitions. Thus, these data may not be strictly comparable to the data for 1993 and beyond. Data were not 
available for 1991. 

47 The NSF survey data for 1993-1999 show a real increase of 1 .2% per year in median annual salaries across all 
degrees for biological scientists working in the for-profit life sciences industries. Applying this growth rate to the 
growth rate of 5.4% for all pharmaceutical industry R&D personnel yields an increase of 6.7% per year in labor 
costs. 
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8.2.3. Cost estimates from published industry R&D expenditures 

PhRMA has gathered information on aggregate industry R&D spending for decades. 
The resultant R&D expenditure time series can be linked to data on new drug approvals 
to develop rough estimates of out-of-pocket pharmaceutical R&D costs. As noted above, 
linking current expenditures to current approvals is an inadequate approach. Our estimated 
time profile for a representative drug and the pattern of costs over that timeline determined for 
this study can be used to construct a lag structure for aggregate expenditures and approvals. 48 

There are two complications regarding the PhRMA data that must be addressed before 
we can validate our estimates. One is that while PhRMA has traditionally disaggregated 
its reported R&D expenditure data into expenditures on new drugs and expenditures on 
improvements to existing drugs, it has not gathered information on how expenditures on 
new drugs can be further decomposed into expenditures on self-originated and on licensed-in 
new drugs. Our R&D cost estimates are for self-originated drugs, and a substantial portion 
of the R&D expenditures on licensed-in drugs are likely missing from the PhRMA data. 49 
Thus, we need to associate lagged industry expenditures on self-originated new drugs with 
self-originated new drug approvals. The second complication is that, with the exception of 
1 year, PhRMA has gathered information on the domestic expenditures of all its firms, but 
the foreign expenditures of only its US-owned members. Our method for dealing with these 
complications is described in detail in an appendix available from the authors upon request 
(Appendix B). 

We related estimated lagged PhRMA member firm R&D expenditures on self-originated 
new drugs from 1978 to 1998 to the number of self-originated new drug approvals by 
PhRMA member firms from 1 990 to 2000. The lag structure follows the phase time-expendi- 
ture profile implied by our data, with weights attached to aggregate expenditures over a 
2-12 year period. The ratio of total lagged self-originated R&D expenditures to the total 
number of self-originated approvals yields an estimate of the out-of-pocket cost of new drug 



48 PhRMA also publishes a breakdown of annual R&D expenditures of its member firms by function (PhRMA, 
2001). The share for the category "Clinical Evaluation: Phases I— III" in 1999 is 29.1%. This share cannot be 
compared to the clinical period share of total out-of-pocket cost per approved drug implied by our estimates for at 
least three reasons. First, clinical period costs in a given year are linked to pre-human R&D expenditures in past 
years, and the pharmaceutical R&D expenditure series shows substantial growth. Thus, shares based on current 
year expenditures will significantly understate the clinical portion. Second, portions or all of some categories are 
for expenditures on post-approval R&D and should be deducted from the base before a pre-approval clinical share 
is computed. For example, given their definitions, the categories for "Clinical Evaluation: Phase IV (1 1 .7%)" and 
"Process Development for Manufacturing and Quality Control (8.3%)" would likely have to be taken entirely out 
of the base. In addition, portions of other categories also likely are associated with post-approval R&D. Third, our 
notion of clinical period costs extends beyond direct patient costs and includes fixed infrastructure costs and other 
costs incurred during the clinical period. The categories "Toxicology and Safety Testing (4.5%)," Pharmaceutical 
Dosage Formulation and Stability Testing (7.3%)," "Regulatory: IND and NDA (4. 1 %) ," "Bioavailability ( 1 .8%)," 
and "Other (9.0%)" would each have to be decomposed into shares for pre-human R&D, pre-approval clinical 
period R&D, and post-approval R&D. With a reasonable pre-human/clinical lag structure, it is possible to choose 
an allocation of the three periods for these functional categories that results in a clinical period share of pre-approval 
R&D expenditures that equals our estimated cost share. However, we are not aware of any data that allows one to 
make these allocations credibly. Thus, we concluded that the PhRMA data on functional categories could not be 
used as an external check on our results. 

49 The PhRMA data apply to member firms. Not every pharmaceutical firm (particularly foreign firms) and few 
biotechnology firms are members of the organization. 
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development. 50 We calculated a range for this ratio by using reported domestic industry 
R&D expenditures for a lower bound and domestic plus foreign (inclusive of estimates 
for foreign-owned firms) industry R&D expenditures as an upper bound. The result is a 
range of US$ 354-558 million for out-of-pocket cost per approved new drug (inclusive of 
failures). Our out-of-pocket cost estimate of US$ 403 million per approved drug calculated 
from our survey data falls within this range. Capitalizing the aggregate expenditure data 
using our phase-expenditure time profile yields a range of US$ 650-1023 million, which 
encompasses our total capitalized cost estimate of US$ 802 million. 

We also conducted a check similar to what the OTA had done in its report (OTA, 1993, 
pp. 61-62). In theory, under our average development and approval time profile described 
above, all industry self-originated new drug R&D expenditures in 1988 would be associated 
with new drug approvals from 1990 to 2000. If each self-originated new drug approval from 
1990 to 2000 by a PhRMA member firm is assumed to cost US$ 403 million, then we can 
use the yearly time-expenditure weights noted above to estimate PhRMA member firm 
total self-originated R&D expenditures in 1988. Doing so yields US$ 6176 million in 2000 
dollars. This value fits within our range for self-originated new drug R&D expenditures 
estimated from the PhRMA data (US$ 4942-7777 million in 2000 dollars). 



9. Conclusions 

The cost of developing new drugs is a topic that has long engendered considerable interest. 
The interest has intensified recently as firms have become increasingly concerned about 
improving productivity in a period of consolidation and cost containment pressures in the 
marketplace, and industry critics question industry statements about the level of R&D 
costs and the impact that price regulation would have on R&D (Public Citizen, 200 1). 51 
We have undertaken the only comprehensive project-based analysis of the costs of drug 
development since our previous study (DiMasi et al., 1991). In the last study we estimated 
average R&D cost to be US$ 231 million in 1987 dollars. For our updated analysis, we 
estimated that total R&D cost per new drug is US$ 802 million in 2000 dollars. Our results 
were validated in an number of ways through analyses of independently derived published 
data on the pharmaceutical industry. Including an estimate of the cost per approved new drug 
for R&D conducted after approval increases total R&D cost to nearly US$ 900 million. Our 
pre-approval estimate represents a two and one-half-fold increase in real capitalized costs. 
On an annualized basis, the growth rate in inflation-adjusted cost was 7.6% for out-of-pocket 
expenditures and 7.4% for capitalized costs. 

Roughly speaking, the current study covers R&D costs that yielded approvals, for the 
most part, during the 1990s. The previous study (DiMasi et al., 1991) generally involved 



50 We believe that aggregating over the expenditure and approval periods is superior to using an average of yearly 
ratios. Year-to-year ratios are highly variable since they are very sensitive to the denominator value (number of 
self-originated new drug approvals) for the year. 

51 Pre-approval R&D expenditures are sunk costs at the time a pricing decision has to be made. Thus, they should 
not affect price setting in an unregulated market. However, to the extent that high past R&D costs predict high 
future R&D costs, then anticipated or realized stringent price regulation can significantly reduce incentives to 
innovate and thereby negatively impact future drug development. 
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R&D for 1980s approvals, and the first study in this series (Hansen, 1979) was mainly 
relevant to 1970s approvals. While the compound annual growth rates in out-of-pocket 
costs between successive studies were similar (7.0% per year between the first two studies 
and 7.6% per year between the last two), the rates of increase for the two major R&D phases 
were quite different. Although both preclinical and clinical period costs increased in real 
terms in this study, the rate of increase for the preclinical period was less than one-third that 
for the first two studies, while the growth rate for clinical costs was nearly twice as high for 
the two most recent studies. 

Our data do not allow us to test hypotheses about factors that affect how costs change 
over time, but some conjectures can be made. For example, over the periods analyzed the 
pharmaceutical industry has increasingly focused on developing treatments for chronic and 
degenerative diseases or conditions associated with those diseases. 52 Therapies for such 
conditions are generally more costly to test, as they typically require more complex patient 
care and monitoring, longer periods for effects to be observed, or larger trial sizes to establish 
their efficacy. 

When the study periods analyzed for the previous study and the current one are com- 
pared, one observes that the number of new drugs approved increased over time, as did the 
number of drugs investigated. This can be associated with patient recruitment that is more 
time-consuming and costlier. 

Finally, the development of more stringent cost containment strategies in the United 
States and abroad such as tiered formularies and the demand for cost-effectiveness results 
may have led firms to test their drugs more often against competitor products already on the 
market (F-D-C Reports, 1999). This will generally be costlier than testing against placebo; 
the trials will likely need to be more highly powered (i.e. clinical trial sizes will have to be 
higher) to establish a statistical difference. 

These factors help explain the growth in clinical period costs. Preclinical (discovery 
and preclinical development) costs also grew in real terms, but much more slowly than in 
the past. The widespread use of discovery technologies, such as combinatorial chemistry 
techniques and high-throughput screening, during the current study period may have created 
enough efficiency gains to slow down the growth of preclinical costs. 

The cost growth rates that we have observed are substantial. There is no guarantee that 
they will continue at these levels, but we can determine where costs would end up if they 
did. The average approval date for our sample was in 1997. Assuming the same growth 
rates for out-of-pocket and capitalized costs, then the out-of-pocket pre-approval cost per 
approved drug for R&D relevant to approvals in 2001 would be US$ 540 million, while 
capitalized pre-approval cost would be US$ 1 . 1 billion. If growth rates were maintained 
and R&D was initiated in 2001 with approvals obtained 12 years later, then pre-approval 
out-of-pocket cost would rise to US$ 970 million and pre-approval capitalized cost would 
rise to US$ 1.9 billion. 

A number of technical factors can work to alter the growth pattern for future R&D 
costs. We observed improved clinical phase attrition rates for the current study. If firms 



52 We have in mind a broader concept than chronic use drugs. The conditions treated may require drugs that are 
used on a short-term, medium-term, or intermittent basis. These conditions may result from the natural course of 
a chronic disease or they may occur as side effects from direct treatment of such complex diseases. 
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can further improve their performance in terminating research early for compounds that 
will not make it to approval, then this will help lower out-of-pocket and capitalized costs. 
Reductions in development times, other things being equal, would also lower capitalized 
costs. Some recent evidence on clinical development times suggest a shortened process, at 
least in the United States (Kaitin and DiMasi, 2000; DiMasi, 2001a), but it is too soon to 
conclude that we are observing a new trend. Finally, emerging discovery and development 
technologies may have profound effects on R&D productivity. Industry analysts that have 
recently examined the impact that genomics and other new technologies may have on the 
R&D process have suggested that as pharmaceutical firms increasingly embrace the new 
approaches, R&D costs may actually rise significantly in the short run (Pharma Marketletter, 
2001; Tollman et al., 2001). The major reason is that the new technologies may generate 
many targets that are currently not well understood. Eventually, though, they argue that the 
science knowledge base will expand sufficiently so that efficiencies will be realized. 

Analyses of private sector R&D costs provide a crucial input to policy-oriented studies. 
For example, R&D cost estimates can be utilized in studies that aim to measure the ex-post 
profitability of new drug development for a given period. This is a timely issue given 
recent media attention on R&D productivity issues and problems in the R&D pipelines 
of many leading firms (Pollack, 2002). Results from our prior studies have in fact been 
used in analyses of the rate of return to pharmaceutical R&D (Grabowski and Vernon, 
1990; OTA, 1993; Grabowski and Vernon, 1994). 53 These studies of the profitability of 
new drug development have not found evidence of significant and sustained excess profits. 
The estimated internal rates of return are quite close to the cost-of-capital. The much higher 
R&D cost estimates for this study raise a question about the recent profit experience of the 
pharmaceutical industry. However, Grabowski and Vernon (2000) found substantial growth 
in pharmaceutical sales for 1990s drug cohorts. A new study (Grabowski et al., 2002) on 
pharmaceutical profitability using some of the cost results in this study and recent sales 
data is qualitatively consistent with the outcomes of the earlier profitability studies (i.e. the 
internal rate of return is close to the industry cost-of-capital). 

Data on R&D costs can also be helpful in analyzing the impact on R&D returns from 
policy changes that affect the intellectual property protection system, drug development 
times, or FDA approval times, and therefore influence private incentives to innovate. The 
Congressional Budget Office (CBO), for example, examined the net effect on pharmaceuti- 
cal industry returns that the Drug Price Competition Act of 1984 had from simultaneously 
reducing the cost of generic entry and increasing effective patent lifetimes (CBO, 1998). 
Simulations of proposed policy changes for these and other variables that affect the costs 
of and returns to pharmaceutical R&D can similarly be conducted using our new estimates. 

The relationship between pharmaceutical industry profitability and investment in R&D 
has recently been examined in Scherer (200 1 ). The author found a high degree of correlation 
between the deviations from trend for the time series on pharmaceutical industry R&D 
expenditures and on gross margins, indicating that R&D outlays are affected significantly 



53 As noted above, tax issues are explicitly considered in such studies. The corporate income tax, however, plays 
a very limited role in such analyses. The reason is that the tax essentially enters symmetrically in the analysis 
(applied to revenues as well as costs), and so the impact on the internal rate of return is minimal. The net present 
value of profits, though, is lower because of the tax. 
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by changes in profitability. The growth rate for gross margins for recent years was also 
substantially lower than the growth rate for R&D outlays, leading to the suggestion that 
R&D growth rates could lessen in the future. If that were to happen, one might ask what 
would happen to R&D costs. This would depend on the outcome of internal rate of return 
analyses by firms on marginal projects. 54 The ultimate impact on future costs, however, 
will also depend on whether and to what degree currently unforeseen biomedical advances 
that expand scientific opportunities will be realized. 

Finally, our results indicate that variability in drug development costs has declined some- 
what but is still substantial. For an earlier period, DiMasi et al. (1995a) found varying 
average clinical period costs for a number of major therapeutic classes. We will examine 
costs by therapeutic category in future research. For that same earlier period, DiMasi et al. 
(1995b) also found that average R&D costs tended to decrease with firm size. The structure 
of the traditional pharmaceutical industry appears to have evolved somewhat since then. 
Examining new drug output levels by firm, DiMasi (2000) found both a long-term decon- 
centration trend for the research-based pharmaceutical industry and substantial new entry 
during the 1990s with respect to traditional small molecule output. The R&D cost data for 
this study can be used in further analyses of R&D productivity at the firm level in future 
research. 
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Protein tyrosine phosphatases OTPs) control signal transduction pathways 
and have recently emerged as potential drug targets. Inhibition of individ- 
ual PTPs can result in the activation of therapeutically relevant kinase cas- 
cades- This is particularly useful in cases where disease is associated with 
hormonal resistance, such as insensitiv'rty to insulin or leptia Currently, 
PTP1B is being investigated by a number of companies as a promising target 
for leptin/insulin mimetics and in the treatment of diabetes and obesity. 
Since all 90-100 PTPs have been identified in the human genome, the chal- 
lenge now is to identify the function of these enzymes and the therapeutic 
indications that may exist for specific PTP inhibitors. 
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Nearly all mtraceUular signalling is controlled by protein phosphorylation/depho^ 
phorylation events. The role of kinases as phosphorylatiiig enzymes -was appreciated 
very early on, while the active role of their counterparts, the phospharases, is now 
also rapidly gaining recognition. The dynamic character of intracellular phosphor- 
ylation and dcphosphorylarion reactions is best appreciated when cells are treated 
wim generic phosphatase inhibitors, such as vanadate or arsenate, which results in 
the rapid and massive phosphorylation of many mtraceliular targets and which has 
plciotropic physiological effects [i]. One may anticipate from this observation that 
inhibition of individual phosphatases would stimulate individual, specific pathways; 
several examples of this concept are provided below. The authors will focus on pro- 
tein tyrosine phosphatases (PTPs), the largest group of phosphatases, as potential 
drug targets and summarise current knowledge of FTP IB, which is the paradigm 
PTP target for insulin- and leptm-rnimetic drugs. 

il Protein tyrosine phosphatase gene families 

Tyrosine phosphatases can be subdivided into 'classical PTPs , which arc phosphoty- 
rosine specific and ^dual^pecific PTPs' (DS-PTPs), which also dephosphorylate 
serine- and threoninc-phosphatc residues. Classical and DS-PTPs arc characterised 
by the PTP signature motif ([H/V]C(X),£[S/Tj). Weaker sequence similarity 
extends to an -250 amino add domain [231. These PTPs form the bulk of the tyro- 
sine phosphatases but the signature motif alone is also found in other, smaller sub- 
families whose overall sequence similarity with these two main classes is not 
obvious. These include Cdc25A-Q LMW-PTPs (low molecular weigm-PTPs), the 
MTM (myombiJarinVrelated and the tensin (TPTPEVlikc phosphatases. The 
authors have explored the Celcra and NCBI (National Center for Biotechnology 
Information) draft human genome sequences for tyrosine phosphatasc-like genes 
based on sequence similarities to each of these subfamilies, based on the presence of 
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Table 1. The number of FTP family genes predicted from Celera and NCBhgenerated human genome sequences, 
based on the various PTP subclasses. 




Celera 




NCB 




Profile 


Genes 


Pseudoqen&s 


Genes 


Pseudocjenes 


'Classkd'PTPs 


39 


13 


35 


12 


DS-FTPs 


33 


5 


31 


4 


Cdc25^c 


3 


0 


3 


0 


LMW-PTP5 


1 


3 


1 


2 


MTM-like 


14 


4 


14 


4 


Tensin (TFTPE)-like 


5 


7 


5 


7 


Total 


95 


32 


89 


29 



DS-PTP: Dual-specific FTP; LMW-PTP: Low molecular weight PTP; MTM: Myotubularin; MCBI: National Center f Of Biotechnology information; PTP: Protein tyrosine 
phosphatase; TPTPE: Tensin-lilce PTP. 



die signature motif and intron-exon predictions (Tabic l). 
For a number of these phosphatases and even for several clas- 
sical and DS-PTPs, it is uncertain as to whether they possess 
innate or inducible en zym atic activity [4]. Other phosphatases 
such as PTEN (phosphatase and tensin homologue deleted 
on chromosome ten) and the MTM ramify members have 
phospholipids as substrates [5]- Interestingly, the number of 
classical and DS-PTPs predicted from the genome is nor dra- 
matically different from an earlier survey based on cDNA and 
EST (expressed sequence tag) data [21. A very similar observa- 
tion was made for another drug target class [?oi]. From a drug 
target discovery point of view, the availability of the human 
genome sequence does not appear to provide many new 
short-term opportunities. The accumulation of PTP ramify 
member sequences has failed to keep pace with their func- 
tional understanding and the authors believe that the PTPs 
arc undcrcxploited as drug targets. 

Consistent with a specific role for PTPs in vertebrate growth 
factor and cytokine signalling, almost no PTPs are found in 
bacteria (except in pathogenic species, as discussed below). A 
recent survey of the Arabidopsis tkaUana plant genome 
reported only a single PTP out of 1 12 phosphatase genes f6). 

3. Protein tyrosine phosphatases in signalling 

Figure 1 represents a simplified view of signalling by receptor 
kinases (green) and PTPs (red, with catalytic domains repre- 
sented as rectangles). Typically, cytokine and growth factor 
receptor signalling is initiated by ligand-induced dimcrisa- 
rion, which activates die receptors kinase domain. Alterna- 
tively, receptors may recruit intracellular kinases such as Jaks 
(Janus-activated kinases) that phosphoryiate the receptor in 
addition to themselves. Both events trigger phosphotyiation 
cascades. It is increasingly clear that multiple negative feed- 
back mechanisms exist that modulate these pathways includ- 
ing: receptor internalisation and degradation; the induction 
of SOCS (suppressors of cytokine signalling), which bind and 



mask receptor phosphotyrosincs; PIAS (protein inhibitors of 
activated STATs [signal transducers and activators of tran- 
scription]); and dephosphoryiation of cascade components by 
PTPs (see [7] for a review of cytokine signalling suppressors)* 
Since PTPs can exist as cither soluble protein or membrane- 
bound 'receptor PTPs associated with the endoplasmic retic- 
ulum, dephosphoryktion can take place while the receptor is 
membrane-anchored or while it is being recycled [8]. 

These feedback mechanisms are likely to be complex and 
subtle. For instance, SOCS are transcriptionally activated by 
cytokines; the PTP, SHP-1 (Src-homology-2 domain-contain- 
ing phosphatasc-1), is pctit-transkrionally activated by phos- 
phorylation and can then dock via its Src-homology-2 (SH-2) 
domain on receptor phosphotyrosines. PTPs and their poten- 
tial receptor substrates have recently been reviewed foioi. A 
number of PTPs have also been shown to regulate second 
messenger phosphoproteins. A number of these intracellular 
targets are listed in Table 2, 

4. Protein tyrosine phosphatase drug targets 

In this section the authors review PTPs that may be consid- 
ered as promising drug targets. These include PTPs that arc 
already widely recognised as such by the industry: FTP IB, 
CD45 and Cdc25B. However, the authors also include a 
number of more speculative targets; the choke for this second 
set has an inherent clement of subjectivity. 

4.1 PTP1B: a paradigm for protein tyrosine 
phosphatases as 'druggable' targets 

Since the end of the nineteenth century, vanadate and its 
derivatives have been known to have therapeutic utility in 
diabetes [13. By 1989, it was clear that this effect was medi- 
ated by the metal's inhibition of a phosphatase, which might 
critically regulate the insulin receptors phosphorylation state 
[5i>52]. Although PTPlB was one of the first PTPs to be 
cloned, ics use as a drug target was only appreciated when it 
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fioum 1 Schematic view of raptor tyrosine kinases and FTPs in signalling, Green: Receptor ^Orange: Receptor- and non- 
1 2S£^^ Angles; Blue: Receptor Knase ligand; Pink: (Put*ive) receptor FTP hgand. 

L Ligand; PTP: Protein tyrosine phosphatase; RTK: Receptor tyrosine kinase. 



was discovered that PTPlB knockout mice display reduced 
plasma glucose and insulin levels, obesity-rcsistancc and 
increased insulin receptor and IRS-l (insulin receptor sub- 
strate- 1) tyrosine phosphorylation [53,54]. These findings have 
incited many companies to start PTPlB drug screening 
projects. The development of an early candidate (Erripro- 
taflb®, Wjrcth) was discontinued following a diabetes Phase 
II trial because of efficacy and side effect issues. However, the 
prospects for PTPlB-targeting treatments are still bri^it, 
especially following impressive results from once-weekly 
injections of antisensc reagents by Isis Pharmaceuticals [119], 
currendy licensed to Merck; PTPlB is considered 'an out- 
standing target* by experts for these disease areas (B Kahn, as 
cited in (56]). 

While PTPlB is widely recognised as a drug target, its 
precise mode of action is not entirely clear. The enzyme has 
good substrate specificity for the autophosphoryhted insu- 
lin receptor 157], but recendy an additional, inhibitory role 
in leptinR signalling was reported, presumably via Jak-2 
dephosphorylation M. These recent findings help explain 
certain aspects of the knockout mice, such as their 
increased energy expenditure and obesity rcsiscance,How- 
ever, attributing the relative contributions of these two 
pathways to die PTPlB knockout phenotype is comply 
cated by the fact that leptin and insulin have overlapping 
functions. Insulin receptors in the brain arc involved in the 
control of obesity and appetite, and enhanced signalling 
through the leptin receptor reduces adiposity, which results 
in improved glucose tolerance. 



4.2 Protein tyrosine phosphatases in the immune system 

4.2.7 Protein tyrosine phosphatases that control 
antigen receptor signalling 

The role of PTPs in the immune system has recently been 
reviewed [90-95]. One of the earliest PTPs to be discovered, 
CD45, is essential for T and B lymphocyte antigen receptor 
signalling [59,601. This PTP would therefore appear to be a 
good target for immimosuppressors. The SH-2-domain-con- 
taining PTP, SHP-1 (not to be confused with SHIP [SH-2- 
domain-containing inositol 5-phosphatasc], a phospholipid 
phosphatase), is also involved in T-cell receptor (TCR) and 
B cell signalling* but its knockout phenotype [83-45] suggests it 
plays a much wider role, which perhaps makes it less suitable 
as a drug target. 

*2L2 Protein tyrosine phosphatases in infectious 
diseases 

There are many interesting examples in which pathogens 
exploit PTPs lo increase their survival. The earliest known 
example is the Yersinia bacterium, which encodes a PTP 
(YopH) rhat is essential for virulence in vivo 196.971- Helico- 
bacter pylori, a common, infectious bacterium and a causative 
agent of gastric ulcers and stomach cancer, is known to trans- 
duce the CagA protein into the gastric epidermal cells on 
which it thrives. Recently, it was found that one of CagAs 
roles is to activate SHP-2 (Srohomology-2 domain-contain- 
ing phosphatasc-2) [98]. Another example is Salmonella, which 
is known to transduce a PTP (SptP) into its host cells [99]. 
Other bacteria (Mycobacteria, Salmonella) arc also suspected to 
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Table 2. PTPs associated with intracellular phosphoprotein cascades. 



Substrate 


Associated FTP 


Method used 


Ref. 


p130<* 


PTP-PE5T 

FTP-cc 

LAR 

PTP1B 

SHP-2 

PTP-SAP 


Trapping mutants ana overeApression 
Overexpres$ion of FTP 

Overexpression of wt and mutant constructs 
Isolation from immunoprecip'rtation 
Overexpression 


[13] 

[14JS] 

(18) 
(191 


STAT-1 


SHP-1 
SHP-2 

PTP-p 
r i rt 


Knockout and overaqpression 
Direct or indirect mutagenesis 
Overexpression and mutagenesis 


120,21] 

1221 

123) 




SHP-1 
FTP-e 


Hirprr or Indirect muta denes is 
Overexpression 

Overexpression and mutagenesis 


[221 
[211 
(241 


STAT-5a/b 


SHP-2 
FTP1B 
TC-PTP 


Substrate trapping assay, phosphatase assay 
Overexpression and substrate trapping 
Overexpression and substrate trapping 


[25,26! 
[27] 


STAT-6 


SHP-1 


Transgenic expression 


[291 


p70zap 


C045 
HPC-PTP 


Knockout in cells and in vitro assay 
Transgenic expression 


[30] 
[31] 


p52Shc 


TC-PTP 


Trapping mutants 


[32] 


c-Abl 


PTP-PEST 


Knockout cells 


[331 


Jak-2 


CD-45 
FTP1B 


Knockout in cells and in vitro assay 
Overexpression, knockout and trapping mutant 


[341 
.[35] 


IR5-1/2 


PTP1B 
LAR 


Trapping mutants and in vitro assay 
Transgenic overexpression 


. 


Src-famify kinases 


PTP-a 

C045 

cPTP-X. r STEP 

PTP-01 

SHP-2 

HPC-PTP 

PTP1B 

PTP-BL 


Overexpression 

Overexpression and cellular knockout 
In vitro and in vivo assays 
Overexpression 

Overexpression and in vitro assay 
Cellular assay 

Cellular knockout and in vitro assay 
in vitro assay, negative regulator 


[38-401 

[41,421 

(43,44] 

[45] 

[46] 

[37) 

[47.48] 

[49] 


potenin 


PTP^(mRPTP^) 


in vitro and ligandnnduced inactivation 


[SO]' 



Far receptor substrates see [9J. 

HPC-PTP: Homotague at PC 12 cell PTP; IRS: insulin receptor substrate; J3fc Janus-activated kinase; LAR: Leucocyte amigen-rdated PTP; PTP: Protein Tyrosine 
phosphatase; PTP-PEST: PEST domain-containing PTP; PTP-5AP: Stomach cancer-associated-PTp; SHP: $rc-hgmology-2 domain-containing phosphatase; Src Sarcoma 
rous virus oncogene; STAT: Signal transducer and activator of transcription; STEP: 5triatum-enriched phosphatase; TC-PTP: T ccll-PTP; wt: Wlldtype. 



manipulate their hosts with PTFfc [98]. It is by no means clear 
if these intracellular or PTP-transducing microorganisms tar- 
get the same hose pathway(s) but such PTPs would a priori 
seem to be good drug targets. 

More indirect evidence of pathogens using PTPs comes from 
the observation that an established drug for the treatment of 
leishmaniasis, sodium sti bogluconate, is in fact a PIT inhibitor 
with selectivity towards SHP-1 and, to a lesser extent* SHP-2 
(i00). Thus, SHP*1, SHP-2 and these microbial PTPs appear to 
be eflcctive targets in the trearmenc of infectious diseases. 

43 Vascular leakage 

Vascular endothelial monolayers play an important role in the 
control of inflammation. Local inflammation involves cytokine- 
induced uprcgulation of endothelial adhesion molecules such as 



L~ and E-sdecrins and enhanced permeability of tight junc- 
tions, followed by neutrophil and macrophage extravasation. It 
was recendy shown that angiopoictin-1 and its endothelial 
receptor, Tie-2, antagonise this process, i.e., they restore blood 
vessel integrity from leakiness and play an anti-tnflamrnatory 
role [101 J. It was also shown that cndothclial-specific PTP- (5 (or 
VE-PTP [vascular endomelial-PTP] for the murine orthologue) 
specifically dcphosphorylatcs activated Tic-2 receptor kinase 
in vitro and in vivo [102], This would suggest thai PTP' (3 is a 
drug target in inflammation, with inhibitors blocking neu^ 
crophil and macrophage extravasation. 

Finally; genetic ablation of the PTP-£ gene results in mac- 
rophage abnormalities, notably in response to lipopolysaccha- 
ride [103]. This would suggest that PT P-E is a potential target 
in septic shock. 
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"fcble 3. PTP knockout mice and human FTP mutation phenotypes. 
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CD45 ' "* Uck T cells, immature B cells* 

Cdc25b Meiotic arrest* 1 

Cdc25c No obvious phenotype* 

DEP/CD148/PTPp2 Die at mid-gestation with severe defects in vascular .organisation'; frequently 
deleted in human cancers* 

tPMA2/Lafora Progressive myoclonus epilepsy* 

GLEPfM Reduced renal filtration surface area* 

Mammary gland defect reduced plasma glucose and insulin levels* 

LC-PTP (He-PTP) No obvious phenotype* 

No obvious phenotype* 

ly/n^-i Human mutations: severe hypotonia and generalised musde weakness* 2 

MTmr.2 Human mutations: Chaitol-Mane-Tooth^pe4B disease (CMT4B), progressive 

de-myelination* 

PE2/PTP36 Androgenisation of female mice, no mammary gland tissue* 

PTen Early embryonic lethality: frequently mutated in human cancers* 

PTP1 3 Enhanced leptin and insulin sensitivity* 

PTp-pest Die at embyronal stage day 8.5, defective neural fold dosure, lack primary 

hepatocytes* 
PTP-a No obvious phenotype* 

P7P-5 Growth retardation, early mortality, posture and motor defects 1 

prp-e Hypomydination* 
pjp-IA-2 Abnormal glucos^stimulated insulin secretion* 

PTp-k No obvious phenotype* 

PTP- U No obvious phenotype* 

H?^ Pituitary displasia, defects in olfactory lobes, reduction in CNS size and cell 

number* 

P7P-£ 'Suggestion of a fragility of myelin'* 

• sHP-1 Moth-eaten (me) mice lethal Haemopoietic dysregulation, splenomegaly, 

runting, autoimmune disease* 
5HP-2 Lethal at embryonal stage day 8,5 - 10, defect in mesoderm patterning* 

TC-PTP Born normal, die 3 - 5 weeks postpartum; defects in both haematopoiesis and 

immune function* __ 
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GifPP- Glomerular eprtheJla! protein; lAR: Ley oocyte antigen-related FTP; MKP: MAP kinase phosphatase; MTM Myotubularin; PTEN: Phosphatase andteran 
homoiogue deleted on chromosome tun; PTP: Protein tyrosine phosphatase: PTP-PEST: PEST domairxomaining PTT> SHP: Src-homOl09y-2 domamHCorrtanlng 
phosphatase: TC-PTP: Toell-PTP. 



4.4 Protein tyrosine phosphatase targets in cancer 

447 Few protein tyrosine phosphatases are tumour 
suppressors 

It is remarkable, given the large number of kinases whose over- 
production or increased activity is associated with cancer, that 
few PTP rumour suppressors are known. An exception is 



DEP/PTPRrJ (demity-enhanced PTP), whose mutations were 
recently found to be associated with colon cancer [63], Another 
recent study looked at expression of 68 phosphatase genes in 
ovarian cancer and found a significant correlation between 
repression of MKP- 1 (MAP kinase phosphatases 1, a DS-PTP) 
and malignancy [104], However, no MKP-1 murarions have 
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(yct) been associated with disease PTEN, a major tumour 
suppressor thai indirectly inactivates phcsphatidyiinositol-3 
(PI3) kinase [i05] is a phospholipid phosphatase rather than a 
PTE To the authors' knowledge, none of the 26 or so 
PTP knockout mice that have been described so far show a 
tendency to develop neoplasia (Table 3). It is possible that 
Mother Nature has hedged her bets by providing for redun- 
dancy in die dephosphorylation control of critical growth fac- 
tor-stimulated kinase cascades. By contrast, some PTPs that 
regulate Src-lrinases or Cdcs may be drug targets in cancer. 

AA2 Protein tyrosine phosphatases that activate 
Src-family kinases 

The activity of Src (sarcoma rous virus oncogene) and other 
Srt-ramiry kinases is controlled by phosphorylation of a- C- ter- 
minal tyrosine residue. Dc-phosphorylaaon of these kinases at 
this critical residue by a FTP disrupts an intramolcajbr inter- 
action with Src's ^terminal SH-2 domain. A large number of 
PTPs have been postulated to dephosphorylacc and activate 
protc-oncogene Src-farfliry kinases (Tabic 2). It is very doubtful 
mat inhibition of each of these PTPs would block Src overac- 
tivity. However, mice lacking PTPkx have reduced Src and fyn 
activity 175), Similarly, PTP-e knockout mice have reduced 
steady-state levels of activated Src, which suggests that PTP-£ 
plays a role in dephosphorylating and activating e-Src, When 
these knockout animals arc mated with Neu oncogene-overex- 
pressing animals they produce transgenic offspring with fewer 
mammary gland tumours than Neu mice that are not mutated 
in PTP-S, which suggests that in these animals Src family 
oncogenes cannot readily cooperate with the Neu oncogene 
to produce malignancies (10G]. A complication is that in many 
narruaUy-oceurting cancers, the C-terminal autoinhibitory 
tyrosine of Src is simply deleted or mutated, which frees the 
oncogene product from PTP control. 

4.43 Cydin-dependent kinase-activating protein 
tyrosine phosphatases 

Cdc25s are a family of three PTPs that activate cydin- 
dependent kinases [107]- Among these, Cdc25B is over ex- 
pressed in a high proportion (> 30%) of primary breast 
tumours, Cdc25B is being explored by a number of compa- 
nies as a drug target and inhibitors have shown efficacy in 
tumour models [108]- 

4.4AA protein tyrosine phosphatase that is specifically 
required for metastasis formation? 
It was recendy discovered that Prl-3 is specifically amplified 
and overexpressed in metastases but not in their o riginatin g 
primary tumours [i09]. This elegant work is strongly sugges- 
tive of Prl-3 being essential for the formation of metastases 
but perhaps not for normal tissue growth. However, it 
remains to be established as to whether Prl-3 activity is 
merely required for the establishment of metastases or for 
their survival also. Only in the latter case would a drug be 
highly effective. 



Table 4. PTPs for which structural information is 
available [55]. 



PTP 


Number of 


Species 


PTP1B 


29 


Li. irn^fS 

numan 


TC-FT? 


1 


Human 


SHP-1 


2 


Human 


SHP-2 


1 


nurnan 


unr DTD 

HrC-PTr 




Mai ka 


PTEN 


1 


Human 


PTP-a 




Mouse 


PTP-u 


; 


Human 


LAR 




Human 


Yop51 




Yersinia 


SptP 


2 


Salmonella 


VHR 


2 


Human 


Kap 


2 


Human 


MKP3/Pyst1, 


1 


Human 


Cdc25A 


1 


Human 


Cdc25B 


4 


Human 


LMW-PTP 


8 


Human, bovine, 
Saccharomyces 
cOTvfc/ae 



LAR: Leucocyte antigen-related ftp; LMW-PTP; Low molecular welght-PTP; 
PTEN: Phosphatase and tensin homologue deleted an chromosome 10; 
PTP: Protein tyrosine phosphatase: SHP: Src-homoiogy-2 domain-containmg 
phosphatase; TC-HP: T cell PTP: VHR: Vaccinia Hi -related protein. 



A3 Osteoporosis 

Experiments involving antisense PTP-0% have implicated 
GX-EPP (glomerular epithelial protein)- 1 (its human ortho- 
logue) as a negative regulator of osteoclastic resorption [110]. 
These results suggest that blocking GLEPP-1 may have a use- 
ful anabolic effect in osteoporosis. Another study observed 
chat alendronate (ALN), an aminobisphosphonate used in the 
treatment of osteoporosis, is in fact an inhibitor of PTP-e, a 
PTP that is highly expressed in osteoclasts [W]< The authors 
of this study speculate therefore that PTP-e may be a useful 
target for the treatment of osteoporosis. 

5. Conclusion and expert opinion 

5.1 Protein tyrosine phosphatase target discovery 
strategies 

The PTPs that have been reviewed here differ in how well 
they are established as targets. Some PTPs, like PTP IB, arc 
well accepted as targets, and PTP IB is now the focus of 
Inhibitor discovery at approximately 20 companies 156], 
whereas for others their 'dniggabilicy is more speculative. 
Validating a PTP drug target is not a trivial task. Much more 
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so than for other signalling component^ predicting the 
effects of PT? inhibition from overexprcssion or substrate 
specificity studies alone has not been very successful. The 
history of PTPlB's discovery as a target dearly illustrates 
that purely deterministic approaches alone arc not appropri- 
ate for PTPs. Today, the exact substrate of PTPlB is not 
known (The insulin receptor? Jak-2? The leptin 

receptor?), but there is very good evidence that PTPlB is a 
good drug target. It would not be wise to count on the next 
PTP target to manifest its 'targctability* as clearly as PTPlB. 
An effective* systematic PTP target approach would instead 
focus on knockout animals and antisense or other inhibitors 
as tested in disease models. Expression patterns and the 
molecular biology of PTPs may provide hints as to disease 
situations in which these reagents and animals should be 
tested. The recent revolution in siKNL/Vbased technologies 
that can be applied in mammalian systems may be very help- 
ful in this respect [U2-1171- 

5.2 A dilemma in new protein tyrosine phosphatase 
drug target discovery 

The authors' current understanding of cell biology in general 
and of PTPs in particular is, by objective standards, rudimen- 
tary. Therefore, they believe that at present a 'knock them out 
and see what happens 9 approach is more effective than 'dissec* 
tive' or 'dercrministic procedures, no matter how elegant or 
awe-inspiring such latter experiments may be. Unfortunately, 
'direct target validation strategics lack academic and scientific 
appeal. In addition, the drug industry is not sufficiently 
investing in efficient, systematic target discovery, because me 
acquisition of intcllectuai property for a drug target presently 
provides an inadequate return of investment. The financial 
rewards instead come from marketable inhibitors that are sub- 
sequently developed against that target (for instance by an 
industrial competitor). This situation may well be partly 
responsible for the paudty of new drugs against new targets 



that enter the market, and it is unlikely that further company 
mergers are the solution. 

S3 Are protein tyrosine phosphatases tractable 
targets? 

Discovering specific PTP inhibitors is a challenge. For 
instance, few of the PTPlB inhibitors reported so far in the 
literature and in patents show impressive selectivity over 
TC-PTP (T cell-PTP), its nearest homologue, which has very 
similar substrate sdectrvity to PTPlB [571. This is worrisome 
because TC-PTP knockout mice die soon after birth [89), On 
the other hand, heterozygous TC-PTP knockout animals are 
fertile and normal in all respects, which would suggest mat 
some degiee of TC-PTP inhibition may be harmless. Also, 
vanadate~bascd compounds, which inhibit virtually all PTPs 
(and hit quite a few non-PTP targets as well), have been tested 
in small-scale human trials with only very modest side effects. 
This suggests that a PTPlB inhibitor need not have absolute 
sdectrvity. In the longer term, other PTPs arc likely to benefit 
from the massive screening efforts on PTPlB, through the 
discovery of new classes of inhibitors [118] and the generation 
of detailed strucrural protein information (Table 4). The first 
PTP cDNAs were cloned in the early nineties, some 3 decades 
after kinases had been discovered and understood to be impli- 
cated in cancers. Yet, the first useful kinase inhibitor drug 
iniatinib mesylate (Gleevec™, Novartis) only entered the 
market in 2001. A comparison with the kinase timeline sug- 
gests that it may be simply too early to tell if and when 
PTP targeting drugs will be ready for the clinic. 
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The druggable genome 



Andrew L. Hopkins and Colin R. Groom 



An assessment of the number of molecular 
targets that represent an opportunity for 
therapeutic intervention is crucial to the 
development of post-genomic research 
strategies within the pharmaceutical 
industry. Now that we know the size of the 
human genome, it is interesting to consider 
just how many molecular targets this 
opportunity represents. We start from the 
position that we understand the properties 
that are required for a good drug, and 
therefore must be able to understand what 
makes a good drug target. 

Biological systems contain only four types of 
macromolecule with which we can interfere 
using small-molecule therapeutic agents: pro- 
teins, polysaccharides, lipids and nucleic acids. 
Toxicity, specificity and the inability to obtain 
potent compounds against the latter three 
types means that the vast majority of success- 
ful drugs achieve their activity by binding to, 
and modifying the activity of, a protein. This 
limits the molecular targets for which com- 
mercially viable compounds can be developed, 



leading to the concept of 'the druggable 
genome' — the subset of the -30,000 genes in 
the human genome that express proteins able 
to bind drug-like molecules. 

One way of assessing the opportunities 
available to the pharmaceutical industry is to 
begin by studying the properties that are 
required in a commercially viable drug. For the 
most part, this means an orally bioavailable 
compound. The physico-chemical properties 
that are necessary to increase the likelihood 
of oral bioavailability have been formalized 
into the 'rule-of-five' 1 (BOX l). Constraints 
such as these dictate the type of protein we 
see as drug targets — simply put, drug tar- 
gets need to be able to bind compounds with 
appropriate properties. 

Druggable protein families 

The druggable subset of the human genome 
can be predicted using several methods. In a 
comprehensive review of the accumulated 
portfolio of the pharmaceutical industry, 
Drews 2,3 identified 483 targets, and concluded 
that there could be 5,000-10,000 potential 



Box 1 | Guidelines for oral bioavailability: the 'rule-of-five' 

The 'rule-of-five' analysis by Upinski et aV shows that poor absorption or permeation of a 
compound are more likely when: there are more than five hydrogen-bond donors; ihe molecular 
mass is more than 500 Da; the lipophilicity is high (expressed as cLogP > 5); and the sum of 
nitrogen and oxygen atoms is more than 10. These rules, more appropriately described as 
guidelines, do not cover drugs that are derived from natural products, for which other 
absorption mechanisms are involved. 

Clearly, published data on the oral bioavailability of existing drugs could be used as a method 
for defining the properties of viable drugs; however, our approach using the rule-of-five allows 
predictions to be made. In practice, the number of targets identified by applying the rule-of-five 
filters differs little from that obtained solely by literature analysis of all known drugs, whether 
rule-of-five compliant or not. 



targets on the basis of an estimate of the 
number of disease-related genes 4 . However, 
this analysis did not focus on the properties 
of the drugs that define those targets. The 
idea of assessing the number of ligand- 
binding domains has also recently been 
introduced as a measure of the number of 
potential points at which small-molecule 
therapeutic agents could act — suggestions 
are that this figure could be even greater than 

10,000 (REF.5). 

Binding sites on proteins usually exist 
out of functional necessity; therefore, most 
successful drugs achieve their activity by 
competing for a binding site on a protein 
with an endogenous small molecule. For a 
drug to be effective, it must bind to its mol- 
ecular target with a reasonable degree of 
potency. Our analysis of the Investiga tional 
Drugs Database (produced by Current 
Drugs) and the Pharmaprojects Database 
(produced by PIB Publications), in addition 
to a thorough review of the literature, iden- 
tifies 399 non-redundant molecular targets 
that have been shown to bind rule-of-five- 
compliant compounds with binding affini- 
ties below 10 uJvl. 

Although there is some degree of overlap 
with earlier work 2 " 4 , we have captured several 
proteins that are targeted by experimental 
drugs, and eliminated some targets for which 
activity has not yet been shown to be modu- 
lated by rule-of- five-compliant compounds. 
Most of the drugs and leads that were identi- 
fied in this survey are competitive with an 
endogenous ligand at a structurally defined 
binding site. 

We have taken the sequences of the drug- 
binding domains of these proteins and deter- 
mined the families that they represent, as 
captured by their InterPro domain 6,7 . Only 
130 protein families represent the known drug 
targets (online table i ). Nearly half of the tar- 
gets fall into just six gene families: G-protein- 
coupled receptors (GPCRs), serine/threonine 
and tyrosine protein kinases, zinc metallo- 
peptidases, serine proteases, nuclear hormone 
receptors and phosphodiesterases (FIG. la). 
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Table 1 | Comparison of the draggable genomes of selected eukaryotes 








Homo 


Drosophila 


Caenorfiabditis 


Saccharomyces 




sapiens 


melanogaster 


elegans 


cerevisiae 


Total number of predicted genes 6 ' 9,16 


-30,000 


13,601 


18,424 


6,241 


Number of proteins in proteome* 


21,688 ' 


13,849 


17,946 


6,127 _ 


Number of estimated druggable targets 


3,051 


1,714 


2,267 


508 


Percentage that are predicted druggable targets 


-10-14% 


12%" " 


12% 


8% 



Three hundred and seventy-six targets identified to bind rule-of-five-compliant drugs have had InterPro domains assigned. The prevalence of these InterPro domains in 
various genomes has then been determined. Twenty-three more bacteriat and viral drug targets for which InterPro assignments could not be made have not been 
included in any of our analyses. # Data taken from InterPro, 29 October 2001 . 



The sequence and functional similarities were able to bind a drug, other members a protein with some precedent for binding a 



within a gene family are usually indicative of a 
general conservation of binding-site architec- 
ture between family members. This would 
suggest that if one member of a gene family 



would also be able to bind a compound with 
similar physico-chemical properties. Using this 
reasoning, 3,051 of the predicted 30,000 or 
so genes in the human genome 8,9 code for 




drug-like molecule (fig. lb, online TABLE i). 
A comparative analysis of the eukaryotic 
genomes of worm, fly and yeast also reveals 
that approximately one in ten of the proteins 
expressed by these genomes belong to gene 
families with members that have previously 
shown modulation by small -molecule drugs . 

(TABLE I)- 



Other 1 19 gene 
families and singleton 
targets 44% 



Cys proteases 2% 
Gated ion-channel 2% 
CYP enzymes 2% 
Cation channels 2% 




Other 114 gene 
families and singleton 
targets 40% 



Short -chain 
dehydrogenases/ 
reductases 2% 

-/-carboxylases 2% 
NHRs 2% 
CYP enzymes 2% 



PDEs 3% 



GPCRs 25% 



ST/Y kinases 10% 

Zn peptidases 4% 
NHRs 3% 

Ser proteases 
(trypsin) 3% 




ST/Y kinases 22% 



GPCRs 15% 



Cation channels 5% 



Ser proteases 
(trypsin) 4% 

Protein 

phosphatases 4% 



Figure 1 | Drug-target families. Gene-family distribution of a | the molecular targets of current rule-of- 
five-compliant experimental and marketed drugs, and b | the druggable genome. Serine (Ser)/threonine 
and tyrosine protein kinases are grouped as one gene family (ST/Y kinases), as are class 1 and class 2 
G-protein-coupled receptors (GPCRs). CYP, cytochrome P450; Cys, cysteine; NHR, nuclear hormone 
receptor; PDE, phosphodiesterase; Zn, zinc. 



Expanding the druggable genome 

At present, approximately half of the proteins 
expressed by the genome are functionally 
unclassified, and of course, some of these 
might prove to be druggable. However, it is 
clear from the distribution of the gene-family 
populations that there are no undiscovered 
large protein families, which indicates that 
remaining targets will be members of very 
small families. Clearly, the number of potential 
protein targets could be larger than the number 
of genes, owing to post-translational modifica- 
tions and assembly of functional complexes; 
however, this is not likely to increase the num- 
ber of specific drug-binding sites. 

Further evidence for this can be drawn 
from the observation that despite numerous 
screening attempts, many targets have failed to 
show any evidence of binding compounds 
that are potent and 'drug-like'. This might be a 
function of the chemical diversity of corporate 
compound files. However, if druggability is an 
inherent property of the protein, then a priori 
assessment criteria of potential targets to 
assess the likelihood of developing a drug 
against a particular site can be developed. As 
most drugs bind to discrete binding sites, 
which can be identified readily by structural 
analysis, it is possible to filter what we term 
'beautiful binding sites' from the wealth of 
protein -structure data that are available in the 
Protein Data Bank and are expected soon 
from structural genomics projects. 

As most drugs compete against small 
molecules for binding sites on proteins, the 
number of these binding sites is probably a 
function of the size of the metabolome (the 
total set of small molecules in an organism). 
One route to target discovery might therefore 
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Figure 2 | Number of drug targets. The effective number of exploitable drug targets can be 
determined by the intersection of the number of genes linked to disease and the 'druggable' subset 
of the human genome. 



lie in identifying enzymes and receptors from 
metabolomic profiling 10 " 12 . By contrast, the 
druggability of targets identified by pro- 
teomic or transcription-profiling studies is 
likely to be low. 

Draggable does not equal drag target 

The ability of a protein to bind a small mole- 
cule with the appropriate chemical properties 
at the required binding affinity might make it 



druggable, but does not necessarily make it a 
potential drug target, for that honour belongs 
only to proteins that are also linked to disease. 

Recent estimates propose that there are 
from 3,000 (REF. 13) to 10,000 (RER4) disease- 
related genes, and large-scale mouse-knockout 
studies have revealed that only - 10% of all 
gene knockouts might have the potential to 
be disease modifying 14 , which supports esti- 
mates at the lower end of this range. 
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Figure 3 | Novel drug launches. The graph shows the number of small-molecule, 'first -in -class' 
drugs and associated new drug targets that have been launched on the market in the past decade 
(data derived from collating annual This Year's Drugs' reviews of Drug News & Perspectives, Prous 
Science). NCE, new chemical entity. 



The potential drug targets that the pharma- 
ceutical industry can exploit are captured in 
the intersection between the druggable 
genome and those genes related to disease, as 
shown in PIG. 2. An analysis of the antifungal 
targets from the yeast genome indicates that 
the intersection might be as small as 2-5% of 
the genome (C.R.G. and J. E. Mills, unpub- 
lished observations) — extrapolating to 
man, this suggests a total of 600-1,500 small- 
molecule drug targets. 

Targets to market 

Despite the massive increases in research and 
development (R&D) investment over the past 
decade, and the advent of molecular biology, 
the rate at which drug targets are clinically vali- 
dated and brought to market is growing rather 
slowly. On average, new drugs are launched 
against only four novel targets each year (FIG. 3). 

The distribution of target types shown in 
FIG. u is similar to the distribution seen in 
the original work of Drews 2,3 , but to our sur- 
prise, of our set of 399 targets with known 
rule-of-five-compliant agents, we could 
identify only 120 proteins as the targets of 
drugs that are actually marketed. This small 
number of targets calls into question the 
common assumption that a large number of 
targets are necessary to build a successful 
industry 13 . Differentiation between drugs 
that bind to the same receptor could lead to 
the development of several distinct classes, 
targeting a range of diseases. 

The overall distribution of launched tar- 
gets by biochemical class is similar to that 
observed for all targets with drug-like leads 
(FIG. 4). Enzymes represent just under half of 
the launched targets (47%), whereas GPCRs 
account for 30%. All other classes, such as ion 
channels and nuclear hormone receptors, 
account for less than a quarter of the identi- 
fied launched targets. 

Implications 

Commercial pressure forces the pharmaceuti- 
cal industry to focus on developing orally 
bioavailable small molecules, limiting oppor- 
tunity to the number of binding sites for such 
molecules on proteins encoded by the genome. 
New mechanisms, such as protein drugs, 
antibody therapies, DNA vaccines and non- 
oral drug delivery systems, could expand the 
range of potential targets to those funda- 
mentally not tractable with rule-of-five- 
compliant therapies. 

A comparison of gene -family size with the 
number of targets in a family that have spe- 
cific leads shows that many large, druggable 
gene families are still under- exploited (FIG. 5). 
The application of high-throughput screening 
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in the pharmaceutical industry and the lim- 
ited number of druggable targets suggest 
that, within the next decade, the industry 
could reach a position in which 'hits' or chem- 
ical leads are available for most potentially 



druggable targets. The challenge for the 
industry will then not necessarily be in the 
discovery of leads, but in discovering and 
assessing the therapeutic utility of its leads 
and druggable targets. 



The limited number of small-molecule 
drug targets suggests that to exploit the 
opportunity of the druggable genome in a 
cost-effective manner, the next round of 
innovation for the pharmaceutical industry 
lies not necessarily just in the science, but also 
in the business models 15 . 
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Figure 5 1 Exploitation of the genome, as measured by known leads. In the past few decades, the 
pharmaceutical industry has assigned high priority to research into gene families, such as kinases, matrix 
metailoproteinases and cysteine proteases. Few drugs aimed at these gene families have yet reached the 
market, although many are progressing through development. Cys, cysteine; GPCR, G-protein-coupled 
receptor; Ser, serine; Thr, threonine; Tyr, tyrosine. 
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EXHIBIT O 

Arthur T. Sands, Current Drug Discovery, Aug. 2002, at 21. 



FEATURE I 

Industrializing breakthrough discovery 

Arthur T Sands 

Lexicon Genetics, USA 

Predicting drug action using mouse knockouts was pioneered by Lexicon; five years 
later the full potential of gene knockout technology is beginning to be realized. In 
combination with comprehensive physiological analysis, the technology delivers 
novel, in v/Vo-validated targets with the potential for the discovery of breakthrough 
therapeutics. 



Innovation in the pharmaceutical industry 
depends on breakthrough biological dis- 
coveries that reveal new targets for thera- 
peutic intervention. These new targets 
must provide potent new mechanisms of 
action to block disease by creating favor- 
able alterations in physiology without 
undesirable side effects. In vivo methods of 
target validation using gene knockouts 
have revealed truly rare and valuable 
targets. This fact stands in direct contrast 
to the popular myth that the human 
genome contains thousands of viable drug 
targets. It seems that a simple truth has 
been all but forgotten in the pursuit of 
ultra-high-throughput drug discovery: it is 
the quality of new targets not the quantity 
that holds promise for replenishing the 
pharmaceutical industry's drug discovery 
pipeline. 

There is plenty of evidence that more is 
not necessarily better. The pharmaceutical 
industry spends $30 billion each year on 
research and development - three times 
more than a decade ago - yet the number 
of new drugs coming to market has not 
increased. The industry's product innova- 
tion bottleneck is especially critical since 
$38.6 billion in brand name drugs will be 
coming off patent over the course of the 
next three years, creating a market void 
that drug makers are not prepared to fill. 

Physiology must guide discovery 

To replenish product pipelines, the indus- 
try is looking to biotechnology companies 
to accelerate the identification and valida- 
tion of new targets. In order to discover 
which genes among thousands encode 



breakthrough targets, industry scientists 
must conduct rigorous physiological 
assessments to determine which targets to 
eliminate and which to pursue. Only those 
targets that demonstrate the potential to 
maximize therapeutic effects and minimize 
side effects should be pursued, thereby 
reducing the failure rate and increasing the 
overall efficiency of the drug discovery 
process. 

Since a therapeutic alteration in 
physiology is the desired endpoint of drug 
discovery, overly reductionist approaches 
that ignore the complexity of mammalian 
physiology are inevitably doomed to 
failure. Computer modeling, DNA microar- 
rays, proteomics and lower model 
organisms cannot encompass the 
complexity of mammalian physiology and 
may actually distract researchers from a 
more productive pathway to discovery. 
Even human genetic studies may be prob- 
lematic, since they are more likely to reveal 
genes that cause disease rather than drug 
targets for future cures. 

Just as drugs must act within the 
context of physiology, novel drug targets 
must be validated within the context of 
mammalian physiology before precious 
resources are expended to develop drugs. 
Grounding genomics in the discipline of 
physiology can increase success rates, 
enhance product pipelines and create safer 
and superior therapeutics, as well as 
reduce the enormous amount of time and 
capital expended for the discovery and 
development of a drug. Those companies 
who are equipped to rapidly and effectively 
integrate physiological information into the 



target selection process will dominate the 
next generation of successful drug 
discoveries. 

Of knockout mice and men 

After a decade of using mouse knockouts, 
the data on their predictive power in drug 
discovery is irrefutable. The top 100 
selling drugs in 2001 are directed only to 
29 drug targets, many with multiple 
agents addressing the same target. Of 
these 29 targets, 23 have been knocked 
out and in every case the knockout mouse 
was highly predictive as to the on-target 
effects and side effects of the associated 
drugs. These observations lay to rest early 
theoretical concerns regarding the reliabil- 
ity of the mouse knockout technology to 
recapitulate actions of drugs in mam- 
malian model systems. The recent near 
completion of the genomic sequence of 
mouse and man, now available through 
either public or private DNA sequence 
databases, has confirmed the high rate of 
genomic similarity between the two organ- 
isms. Indeed, many decades of research 
have proved the mouse to be an invaluable 
tool for the evaluation of biological 
processes relevant to human disease, 
including immunology, oncology, neurobi- 
ology, cardiovascular biology, obesity and 
many others. Well-established parallels 
exist between humans and mice on cellu- 
lar, biochemical and physiological levels. 

Industrializing discovery 

At Lexicon Genetics mouse knockouts are 
guiding researchers to discover new thera- 
peutic agents which represent the best 
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physiologic switches in the genome for the 
treatment of disease. This has required the 
industrialization of gene targeting, gene 
trapping and mouse 
embryonic stem cell 
technologies, as 
well as the build-up 
of significant scien- 
tific infrastructure. 
This infrastructure 
will allow the 
company to analyze 

5000 genes as mouse knockouts in its 
Genome5000 program over the next five 
years. Efforts are concentrated on the 
unknown function of known gene families 
for which therapeutic agents can be devel- 
oped through small molecule chemistry, 
antibody or therapeutic protein develop- 
ment. These gene families include G 
protein-coupled receptors, kinases, pro- 
teases, ion channels, secreted proteins, 
transporters and other key enzyme classes. 
Gene targeting by homologous recombina- 



tion combined with gene trapping maxi- 
mizes both selectivity and throughput for 
large-scale, in vivo target validation. 



"It seems that a simple truth has been all but forgotten in the pursuit of 
ultra-high-throughput drug discovery: it is the quality of new targets not the 
quantity that holds promise for replenishing the 
drug discovery pipeline." • 



The company has deployed a compre- 
hensive, in vivo analysis of candidate drug 
targets that has been modeled after clinical 
evaluation. Genes analyzed in this way are 
subject to a superior level of in vivo analy- 
sis, including physiological function and 
potential disease indication, providing a 
robust pipeline of high-value targets. This 
approach has already proved successful in 
extracting vital information about the 
potential medical utility of several new 
targets in atherosclerosis, diabetes, obesity 



and CNS disease, among others. Lexicon's 
physiological analysis utilizes a wide range 
of the latest medical technologies, includ- 
ing intensive ana- 
lytical procedures 
such as the CAT 
scan for organ 
system visualiza- 
tion, dual energy X- 
ray absorptiometry 
for measurement of 
percentage fat and 
lean body mass and bone mineral density, 
functional magnetic resonance imaging, 
which allows in vivo neurochemical and 
cardiac analysis, clinical blood and urine 
chemistries, complete blood cell counts, 
fluorescent-activated cell sorting, cell-cycle 
analysis and neurobehavioral testing. 
Histopathological and gene expression 
surveys of 55 tissues provide cellular and 
gene expression data for additional infor- 
mation. Disease challenge models may 
also be used when indicated to maintain a 




Computer modeling, DNA microarrays, proteomics and lower model organisms cannot encompass the complexity of mammalian physiology and may 
actually distract researchers from a more productive pathway to discovery. 
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high degree of sensitivity, enabling the 
detection of subtle phenotypes that may be 
of significant medical value. 

The phenotype derived from the knock- 
out of a specific gene reveals both the 
potential therapeutic value as well as other 
target-specific side effects that may be 
anticipated for a small molecule inhibitor 
of that target. 
For instance, a 
target may 
display thera- 
peutic potential 
in inflamma- 
tion, but might 
also be critical 
for renal func- 
tion. Without a mammalian knockout 
model, these deleterious target-specific 
side effects might not be observed until 
after significant amounts of time and 
resources have been spent on developing 
small-molecule compounds and testing 
them in preclinical or clinical development. 
When a drug produces a deleterious effect 
that was not observed in the knockout 
animal, it suggests that further optimiza- 
tion of the compound's specificity is worth- 
while. The ability to produce strong 
preclinical data to support efficacy and 
lack of deleterious side effects for a novel 
target and corresponding lead compound 
further legitimizes the value of a drug dis- 
covery program and provides confidence to 
move ahead aggressively in development. 

Predicting breakthrough therapeutics 

Gene knockouts can be viewed as model- 
ing the biological mechanism of drug 



action by presaging the activity of highly 
specific antagonists in vivo. This informa- 
tion is critical when making decisions 
regarding target prioritization for a drug 
discovery enterprise. Since knockout mice 
have been shown to model drug activity, 
they provide an unprecedented level of 
predictive power over the drug discovery 



process and can be extremely valuable to 
the pharmaceutical and biotechnology 
industries. With the effective use of mouse 
knockout technology, expensive drug dis- 
covery activities such as high-throughput 
screening, medicinal chemistry, preclinical 
research and clinical trials can be focused 
on the drug targets that are most likely to 
lead to breakthrough therapeutics. 

Hypothesis-driven gene targeting and 
gene trapping place physiology and thera- 
peutic potential at the forefront of the drug 



discovery process and will provide primary 
data on the physiological function of virtu- 
ally all members of 'drugable' gene families 
over the next few years. However, the full 
power of knockout mouse technology can 
only be realized when the predictive nature 
of knockout mouse phenotypes is applied 
early in the drug discovery process. The 
combination of mouse 
gene knockout technol- 
ogy and comprehensive 
physiological analysis 
will provide the phar- 
maceutical industry 
with novel, in v/Vo-vali- 
dated targets with clear 
potential for the discov- 
ery of breakthrough therapeutics. 

Arthur T Sands MD, PhD 

President & CEO 

Lexicon Genetics Inc 

8800 Technology Forest Place 

The Woodlands 

TX 77381-1160 

USA 

Email: asands@lexgen.com 
www.lexgen.com 
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teristic of type 2 diabetes that correlates with 
the increased expression of PTP-1B (2). PTP- 
1B directly interacts with the activated insulin 
receptor (5), and vanadate, a potent nonselec- 
tive PTP inhibitor, can function as an insulin 
mimetic both in vitro and in vivo {4). However, 
PTPs other than PTP- IB can also dephosphor- 
ylate the activated insulin receptor (5). To clar- 
ify the role of PTP- IB in insulin action, we 
generated mice in which the mouse homolog of 
PTP- IB was disrupted. 

The murine gene encoding PTP- IB was 
cloned from a 129/Sv mouse genomic library 
and shown to consist of at least nine exons 
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Protein tyrosine phosphatase-IB (PTP-1 B) has been implicated in the negative 
regulation of insulin signaling. Disruption of the mouse homolog of the gene 
encoding PTP-1B yielded healthy mice that, in the fed state, had blood glucose 
concentrations that were slightly lower and concentrations of circulating insu- 
lin that were one-half those of their PTP-1 B 4 "' 4 littermates. The enhanced 
insulin sensitivity of the PTP-1B _/ ~ mice was also evident in glucose and insulin 
tolerance tests. The PTP-1 B _/ ~ mice showed increased phosphorylation of the 
insulin receptor in liver and muscle tissue after insulin injection in comparison 
to PTP-1 B +/+ mice. On a high-fat diet, the PTP-1 B~'~ and PTP-1 B H / ~ mice were 
resistant to weight gain and remained insulin sensitive, whereas the PTP-1 B +/ + 
mice rapidly gained weight and became insulin resistant. These results dem- 
onstrate that PTP-1 B has a major role in modulating both insulin sensitivity and 
fuel metabolism, thereby establishing it as a potential therapeutic target in the 
treatment of type 2 diabetes and obesity. 
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spanning more than 20 kb (<5). A targeting 
vector was designed to delete a portion of the 
gene that included exon 5 and the tyrosine 
phosphatase active site in exon 6 and to 
replace it with the neomycin resistance gene. 
Two separate embryonic stem cell clones that 
had undergone homologous recombination 
and possessed a single integration event were 
used to microinject Balb/c blastocysts. Chi- 
meric males were mated with wild-type 
Balb/c females, and heterozygotes from this 
cross were mated to produce animals ho- 
mozygous for the PTP-1B mutation (Fig. 
1A). Immunoblot analysis of liver micro- 
somes revealed that PTP-1B protein was ab- 
sent in PTP-1B null mice, and heterozygotes 
expressed about half the amount of PTP-1B 
as did wild-type mice (Fig. IB). PTP-1B""'-, 
heterozygous, and wild-type littermates were 
born with the same appearance and with the 
expected mendelian ratio of 1:2:1. PTP- 
lB~ /_ mice grew normally, did not show any 
significant difference in weight gain as com- 
pared to wild-type mice, have lived longer 
than 1 .5 years without any sign of abnormal- 
ity, and are fertile. Complete necropsies were 



+/- -f~ +/+ +/- -f- 





PTP-1 B 



Fig. 1. Gene targeting of the PTP-1 B locus. (A) 
Representative genomic Southern blot analysis 
of tail DNA digested with Bam HI from a het- 
erozygous cross resulting in wild-type (+/+), 
heterozygous (+/-), and homozygous (-/-) 
PTP-1 B offspring. (B) PTP-1B immunoblot anal- 
ysis of liver membrane samples from PTP- 
1B +/+ PTP-1 B +/ ", and PTP"'- mice. 



done on male and female wild-type, het- 
erozygous, and homozygous PTP-1B mutant 
mice 7 to 8 weeks old, and no gross or 
histological (brain, liver, muscle, pancreas, 
and testes) differences were observed. 

If the role of PTP-1B in the insulin sig- 
naling pathway is to dephosphorylate the ac- 
tivated insulin receptor, then mice deficient 
in PTP-1B might have a sustained insulin 
response because the insulin receptor would 
remain phosphorylated and hence be activat- 
ed longer than in PTP-1 B +/+ mice. We mea- 
sured glucose and insulin concentrations in 
fasted and fed animals (7) (Fig. 2). In the fed 
state, the PTP-1 B~ /_ mice had a significant 
13% reduction in blood glucose concentra- 
tions, whereas the heterozygotes had an 8% 
reduction when compared to wild-type mice 
(Fig, 2 A). The PTP-1 B~'~ mice had circu- 
lating insulin concentrations that were about 
half those of control fed animals (Fig. 2B). 
Thus, PTP- IB-deficient mice appeared to be 
more insulin sensitive, because they main- 
tained lower glucose concentrations with sig- 
nificantly reduced amounts of insulin. In the 
fasted state, there were no significant differ- 



ences in concentrations of glucose or insulin. 
We examined insulin sensitivity in PTP- 



1B- 



PTP-1B" 



and PTP-1 B H 



with oral glucose and intraperitoneal insulin 
tolerance tests (#). Administration of a bolus 
of glucose to PTP-1 B w ~ mice resulted in a 
more rapid clearance of glucose than was 
observed in wild-type mice (Fig. 3A). There 
was a more pronounced hyperglycemia in the 
wild-type animals at all time points after 
glucose administration than in PTP-1 B-defi- 
cient mice. Increased insulin sensitivity was 
also observed upon injection of insulin (Fig. 
3B). Hypoglycemia was evident 30 and 60 
min after injection, but by 120 min after 
injection glucose concentrations were return- 
ing to normal values in wild-type mice, 
whereas the PTP-1 B~deficient mice re- 
mained hypoglycemic (P < 0.02). The PTP- 




Ffg. 2. Serum concentrations of glucose and 
insulin in animals fed ad libitum or fasted over- 
night. (A) Glucose and (B) insulin concentra- 
tions were determined as described (7). Dark 
bars indicate fed group [(A) and (B), n = 19 to 
21]; light bars indicate fasted group [(A), n = 8 
to 10; (B), n = 6]. Values are given as means ± 
SEM. Statistical analysis was done with a two- 
tailed, unpaired, Student's r test. Compared to 
the wild type, *P = 0.06 and **P < 0.01. 



Time (min) 



Fig. 3. Glucose and insulin tolerance tests in 
PTP-1B +/+ (diamonds), PTP-1B+'" (squares), 
and PTP-1 B~ /_ (triangles) mice. (A) Glucose 
tolerance of male mice 10 to 14 weeks old [n = 
11). (B) Insulin tolerance of male mice 10 to 14 
weeks old (n = 5 to 6). Data are presented as 
means ± SEM. Statistical analysis was done 
with a two-tailed, unpaired, Student's t test. 
Compared to the wild type, *P < 0.05 and **P 
< 0.02. 



Table 1. Fasting glucose, triglyceride, and insulin levels of male wild-type, heterozygote, and PT-1B 1 littermates fed a normal or high-fat diet. Values 
are given as the means ± SEM. Statistical analysis was done with a two-tailed, unpaired, Student's t test. ND, not determined. 



Diet 


PTP-1 B 




PTP-1 B +/ " 




PTP-1B"'- 




Normal 


High fat 


Normal 


High fat 


Normal 


High fat 


Glucose (mM) 
Triglycerides (mM) 
Insulin (ng/ml) 


6.1 ±0.3 
1.84 ± 0.76 
0.30 ± 0.02 


8.1 ±0.6* 
2.41 ± 0.19 
0.98 ± 0.32 


6.2 ±0.3 
1.43 ± 0.44 
ND 


7.3 ±0.6 
2.44 ± 0,32 
0.97 ± 0.30 


6.3 ±0.3 
0.86 ±0.18* 
0.33 ± 0.08 


7.0 ±0.4 
1.46 ±0.15* 
0.45 ±0.14* 



*P < 0.05 (n = 6 to 10). 
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1B* 7 ~ mice did not show altered glucose 
tolerance as compared to that in wild-type 
mice, although glucose concentration at 120 



140 




Fig. 4. Increased and prolonged tyrosine phospho- 
rylation of the insulin receptor in PTP-IB"'" 
mice. (A) Time course of tyrosine phosphoryl- 
ation of the insulin receptor (IR) p subunit in the 
liver after insulin treatment in PTP-1B +/+ and 
PTP-IB"'" mice. Quantification of the insulin 
receptor p subunit phosphotyrosine content from 
immunoblots was performed by densitometry. 
Data are presented by setting the amount of 
phosphorylation at 1 min to 100 and the subse- 
quent amount after 5 min for the same animal 
relative to this value. The results are from five 
PTP-IB - ' - and PTP-1B +/+ mice each, from three 
separate experiments. (B) Phosphorylation of the 
insulin receptor p subunit in muscle of insulin- 
treated PTP-1B +/+ and PTP-IB"'" mice. The 
quantified data from immunoblots (n = 6, from 
two separate experiments) are presented as arbi- 
trary units. (C) Insulin-stimulated tyrosine phos- 
phorylation of IRS-1 from muscle of PTP-1B + ' + 
and PTP-IB - '" mice 2 min after injection (n = 
3). The quantified data are presented as arbitrary 
units. Data are means ± SEM. Statistical analysis 
was done with a two-tailed, unpaired, Student's t 
test comparing in (A) the 5-min to the 1-min 
time point value and in (B) the PTP-1B"'" mice 
2- and 6-min time point values to the respective 
values of the PTP-1B + '+ mice (*P < 0.05). 



min after injection of insulin appeared to be 
intermediate between that of wild-type and 
PTP-1B" 7 " mice (Fig. 3). 

Binding of insulin to its receptor results in 
autophosphorylation of the receptor on ty- 
rosines ll 46, ll 50, and H5l in the kinase 
regulatory domain (9). This causes activation 
of the insulin receptor tyrosine kinase, which 
phosphorylates the various insulin receptor 
substrate (IRS) proteins that propagate the 
insulin signaling event (9). If PTP-IB dc- 
phosphorylates the activated insulin receptor, 
then the increased insulin sensitivity ob- 
served in the PTP-IB -7 " mice may be due to 
increased or prolonged phosphorylation of 
the receptor (or both). We therefore measured 
tyrosine phosphorylation of the insulin recep- 
tor in liver and muscle tissue (Fig. 4, A and 



70- 



Male 




Fig. 5. Resistance of PTP-1B null and heterozy- 
gous mice to diet-induced obesity. The percent 
weight gain of male and female wild-type (di- 
amonds), heterozygous (squares), and homozy- 
gous (triangles) littermates fed a high-fat diet 
for 10 weeks is shown. The starting weight 
(male: +/+, 27.6 ± 1.4 g; +/-, 28.5 ± 1.2 g; 
and -/-, 26.3 ± 1.2 g; female: +/+, 22.1 ± 
0.8 g; +/-, 22.2 ± 0.8 g; and -/-, 21.5 ± 0.8 g) 
and final weight (male: +/+, 41.4 ± 1.3 g; +/-, 
37.2 ± 2.0 g; and -/-, 33.5 ± 1.6 g; female: 
+ /+, 33.3 ± 1.7 g; +/-, 27.3 ± 1.3 g; and -/-, 
27.2 ± 1.4 g) of animals put on the high-fat 
diet are indicated. The final weight was signif- 
icantly different (P < 0.05) for PTP-1B null and 
heterozygous mice as compared to wild-type 
mice, except for male wild-type mice versus 
heterozygotes (P - 0.1). 



B) after exposure to insulin (JO). The kinetics 
of insulin receptor phosphorylation in the 
liver were significantly different in PTP- 
IB ~'~ mice than those observed in the PTP- 
IB 4 "' 4 * mice. The amount of insulin receptor 
phosphorylation was the same for both PTP- 
1B"'" and PTP-IB 4 "' 4 " animals 1 min after 
injection. However, by 5 min after injection, 
phosphorylation decreased to about 50% of 
maximal in the PTP-IB 4 "' 4 " animals, whereas 
no reduction was observed in the PTP-I B w ~ 
mice. A greater effect on insulin receptor 
phosphorylation was observed in muscle of 
the PTP-IB" 7 " mice. The absolute amount of 
receptor phosphorylation was increased about 
twofold in muscle from PTP-IB"'" mice as 
compared to that in muscle from PTP-IB 4 " 7 "*" 
animals (P < 0.05) (Fig. 4B). The amount of 
insulin receptor phosphorylation in muscle 
did not significantly change between 2 to 6 
min after insulin injection in either PTP- 
1B ~/~ or pjp.iB 4 -^ samples. No difference 
in the amount of insulin receptor expression 
was detected between PTP-IB 4 *' 4 " and PTP- 
1B~ 7 ~ mice, as determined by protein immu- 
noblotting (11). 

To confirm that the increased phosphoryl- 
ation of the insulin receptor in the muscle of 
insulin-treated PTP-IB" 7- mice reflects in- 
creased kinase activity, phosphorylation of 
IRS-1 was also measured (12) (Fig. 4C). 
Phosphorylation of IRS-1 was increased in 
insulin-treated muscle from PTP-IB" 7 " mice 
in comparison to muscle from wild-type an- 
imals (P < 0.05). We have also examined 
phosphorylation of another receptor tyrosine 
kinase, the epidermal growth factor receptor, 
and found no difference between PTP-IB 
wild-type and deficient animals (//). The 
increased insulin receptor phosphorylation in 
muscle and its sustained phosphorylation in 
the liver probably accounts for the enhanced 
insulin sensitivity observed in the PTP- 
1B~ 7 ~ mice. 

The disruption of the PTP-IB gene dem- 
onstrates that altering the activity of PTP-I B 
can modulate insulin signaling in vivo. To 
determine the effect of the loss of PTP-IB 
activity on insulin resistance, PTP-lB-defi- 
cient, wild-type, and heterozygote littermates 
were subjected to a diet high in fat (50% of 
calories from fat) and calorics (5286 kcal 
kg~'; Bioserve, NJ) {13) that normally results 
in obesity-induced insulin resistance (14). 
During the 10 weeks the mice were on this 
diet, male and female wild-type littermates 
rapidly gained weight, whereas PTP-IB" 7 " 
and PTP-IB 4 " 7 " mice were substantially pro- 
tected from diet-induced weight gain (Fig. 5). 
The amount of food consumed by the animals 
did not differ, which indicates that decreased 
expression of PTP-IB (heterozygotes have 
about half the amount of PTP-IB as that in 
wild-type animals) can influence dietary-in- 
duced obesity. 
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We examined the effect of the high-fat 
diet on insulin sensitivity in these animals. 
Glucose and insulin concentrations in the se- 
rum of fasting animals were measured, and 
glucose and insulin tolerance tests were done 
on all groups of animals [values for males are 
presented; females had essentially the same 
response (11)]. In wild-type mice, the high- 
fat diet produced a 30% increase (P < 0.05) 
in fasting blood glucose concentrations and a 
threefold increase in serum insulin concentra- 
tions (Table 1) as compared to wild-type 
mice on a normal diet. In contrast, the PTP- 
jq-/- animal maintained glucose and insu- 
lin concentrations while on the high-fat diet, 
which were not significantly different from 
those in animals on a normal diet (Table 1). 
PTP-1B heterozygotes on a high-fat diet 
showed increased fasting concentrations of 
circulating insulin but had fasting glucose 
concentrations that were not significantly dif- 
ferent from those in animals on a normal diet 
(Table 1). PTP-1B _/ " mice also had en- 
hanced insulin sensitivity as compared to 
their wild-type littermates in both glucose 
and insulin tolerance tests (Fig. 6, A and B), 
The difference in insulin sensitivity between 
the PTP-1B W ~ and PTP-1B +/+ mice be- 
came more augmented on the high-fat diet 
because of the obesity-induced insulin resis- 
tance of the wild-type mice. Obesity- induced 
insulin resistance results in a reduction in 
insulin receptor phosphorylation and hence in 
insulin signaling (14). Examination of insu- 
lin-stimulated receptor phosphorylation in 
mice on the high-fat diet revealed that there 
was a much greater difference in the amount 
of insulin receptor phosphorylation between 
the PTP-1B wild-type and deficient animals 
(Fig. 6C). This increased difference appears 
to be due to both a reduction in the amount of 
insulin-stimulated receptor phosphorylation 



in wild-type mice and a slight increase in the 
amount of insulin-stimulated receptor phos- 
phorylation in the PTP-IB" 7 "" mice. The 
PTP-1B heterozygotes on the high-fat diet 
also appeared to maintain a better response to 
insulin-stimulated receptor phosphorylation 
than did wild-type animals (Fig. 6). 

The reason for the obesity resistance ob- 
served in the PTP-1B~'~ mice is unclear at 
this time, but analysis of triglyceride concen- 
trations indicates that fat metabolism has 
been affected in these animals. The PTP- 
!B _/ ~ mice on either diet had significantly 
lower triglyceride concentrations than did 
wild-type and heterozygous mice (Table 1). 
We also examined insulin-stimulated recep- 
tor phosphorylation in adipose tissue and 
found no significant difference between wild- 
type and PTP-lB-deficient animals on either 
diet (11). Thus, PTP- IB-deficient mice ap- 
pear to show tissue-specific insulin sensitiv- 
ity; muscle and liver have an enhanced insu- 
lin sensitivity, whereas adipose tissue re- 
mains unchanged relative to wild-type mice. 
The effects that were observed in adipose 
tissue could be the result of some compensa- 
tory mechanism such as the up-regulation of 
some other PTP [although no difference was 
observed in the amount of PTP activity be- 
tween wild-type and PTP-lB _/_ tissue ex- 
tracts (11)] or could possibly be due to the 
overall enhanced insulin sensitivity of the 
PTP- IB- deficient animal. 

The data presented identify PTP- IB as 
having a major role in the insulin signaling 
pathway. What this function is remains to be 
clarified, but the simplest explanation would 
be that it dephosphorylates the activated in- 
sulin receptor. Recently, the disruption of the 
leukocyte antigen-related (LAR) PTP, which 
has also been suggested to affect the insulin 
signaling cascade, has been described. The 



targeted mutagenesis of LAR produced mice 
with impaired mammary gland development 
(15) but with blood glucose concentrations 
within the normal range (16). In contrast, the 
LAR-deficient mice generated by insertional 
mutagenesis had body weights that were half 
those of control mice and were insulin resis- 
tant (17). The reason for the difference in 
phenotype between these two LAR-deficient 
mice strains is unknown. 

We have shown that the loss of PTP- IB 
activity causes enhanced insulin sensitivity 
and resistance to weight gain in mice. These 
results make PTP- IB a potential therapeutic 
target for the treatment of type 2 diabetes and 
obesity. 
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A ACE Hyperandrogenism Guidelines 



AMERICAN ASSOCIATION OF CLINICAL ENDOCRINOLOGISTS 
MEDICAL GUIDELINES FOR CLINICAL PRACTICE 
FOR THE DIAGNOSIS AND TREATMENT 
OF HYPERANDROGENIC DISORDERS 



MISSION STATEMENT 

The purpose of this document is to provide guidelines 
for the diagnosis and treatment of hyperandrogenic disor- 
ders in women. These may range from simple hirsutism, 
without clearly demonstrable biochemical features of 
hyperandrogenism, to frank virilization. The symptoms 
and signs include those that involve the pilosebaceous unit 
(PSU) — that is, hirsutism, acne, and alopecia — and those 
that involve the female reproductive system — amenorrhea 
and infertility. Hyperandrogenism is also a forerunner of 
serious cardiovascular problems (for example, hyperten- 
sion, microvascular disease, and dyslipidemias) and other 
metabolic disorders (such as type 2 diabetes mellitus). 
Thus, clinical endocrinologists seem optimally qualified 
to address these disorders. 

These guidelines are not to be considered an exten- 
sive review of the literature on this topic or an exhaustive 
analysis of recent advances in this field. They are intend- 
ed to be what the word guidelines implies: a brief summa- 
ry of the accepted scientific information and views on this 
topic as well as suggestions for the diagnosis and treat- 
ment of these disorders. The presented material is in a 
form that can be easily used and applied to practical clin- 
ical situations encountered by endocrinologists. 

INTRODUCTION 

"Hyperandrogenism" is a term used to describe the 
most common clinical signs in women with hyperandro- 
genemia: hirsutism, acne, and alopecia. It is also the 
hyperandrogenic state that drives the various other patho- 
logic conditions in a wide range of tissues and organ 
systems. The most commonly diagnosed conditions 
associated with hyperandrogenism in reproductive-age 
women are ovulatory disorders and polycystic ovary syn- 
drome (PCOS). PCOS has been estimated to affect 5 to 
10% of women in this age-group (1). The manifestations 
of hyperandrogenism are detected and frequently classi- 
fied relative to the medical specialty practiced by the 
attending physician. For example, the dermatologist may 
note some of the cutaneous manifestations (acne, 
hirsutism, and alopecia), the gynecologist addresses the 
menstrual dysfunction (amenorrhea, oligomenorrhea, 
menorrhagia, and metrorrhagia) and the underlying 



ovarian dysfunction (anovulation, oligo-ovulation, pelvic 
pain, ovarian cysts, and infertility), the pediatrician deals 
with associated congenital adrenal hyperplasia (CAH) or 
ambiguous genitalia, the internist diagnoses the dyslipid- 
emias, hypertension, and impaired glucose tolerance that 
may be associated with long-standing hyperandrogene- 
mia, and the endocrinologist usually encounters patients 
with hyperandrogenism who have symptoms and signs of 
hirsutism, acne, and insulin resistance. 

Regardless of which specialty assumes the responsi- 
bility for the management of the patient's condition, the 
disease is an endocrine disorder — a hyperandrogenic 
state — that may precipitate pathologic events in various 
organ systems (for example, PSU activation, ovulatory 
dysfunction, ovarian cysts, menstrual dysfunction, insulin 
resistance, infertility, and cardiovascular disorders). The 
disease is usually caused by excessive androgen produc- 
tion by the ovaries, the adrenal glands, or both. It can also 
be attributable to an abnormality of the androgen receptor 
mechanisms in the target cell (the incidence of this abnor- 
mality is unknown). Hyperandrogenism caused by abnor- 
malities in peripheral metabolism of steroids is also rare. 

These guidelines deal primarily with the following 
factors: 

• Symptoms of hyperandrogenism 

• Diagnostic evaluation of hyperandrogenism 

• Determination of the site (or sites) of excess androgen 
production 

• Determination of pathologic and pathophysiologic 
changes at the site of androgen hypersecretion 

• Therapy for hyperandrogenism 

SYMPTOMS OF HYPERANDROGENISM 
The Pilosebaceous Unit 

Acne 

Acne is commonly present in female adolescents. It 
occurs in almost 50% of teenage subjects. Persistence of 
acne into the late teens or 20s, however, should alert the 
pediatrician or the endocrinologist to the possibility of 
hyperandrogenism. This scenario is particularly likely if 
the acne is resistant to the usual dermatologic treatment 
strategies and is associated with hirsutism or menstrual 
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dysfunction. Under these circumstances, acne should be 
considered a sign of hyperandrogenism that necessitates 
appropriate diagnostic investigation (2,3). The variability 
in the genetic susceptibility of the PSU to androgen stim- 
ulation is associated with variations in the clinical expres- 
sion of the hyperandrogenism (4). Women with acne alone 
may have levels of plasma testosterone as high as those 
with hirsutism, with or without acne. Similarly, no corre- 
lation exists between the severity of acne and the plasma 
free testosterone levels (5). As with other manifestations 
of the response of the PSU to hyperandrogenism, persis- 
tent acne should prompt an investigation of other andro- 
gen-related disorders. Of note, ovulatory dysfunction is 
common in young women with acne (6). In one study of 
women seen in consultation primarily for acne, 45% of 
cases were associated with polycystic ovaries (7). 

Hirsutism 

Hirsutism is defined as excessive hair growth in 
women, occurring in anatomic areas where the hair folli- 
cles are most androgen sensitive. Hirsutism manifests as 
an excessive or inappropriate development and growth of 
the PSU (6). Androgens cause transformation of vellus 
(fine, soft, unpigmented) hair to terminal hair in androgen- 
dependent areas of hair growth. 

The amount, distribution, and progression of human 
body hair have racial, familial, genetic, and hormonal 
influences. The massive hairiness of aboriginal Ainu in 
Japan is considered a sign of racial purity. Extreme genetic 



(nonracial) hairiness has been documented in art (8). 
Nordic and Anglo-Saxon European peoples, indigenous 
natives of North and South America, and African 
Americans are generally less hairy than are Mediterranean 
peoples. About one in three non-Scandinavian or non- 
Asian women has some hair on the upper lip, periareolar, 
and suprapubic areas. East Asians tend to be less hairy 
than Euro-Americans, with no difference in testosterone 
levels (9). Hormonal levels are also normal in prepubertal 
simple hypertrichosis (10), which is the appropriate term 
for hairiness of nonhormonal origin, in contrast to hir- 
sutism and its extreme, virilization. Familial hairiness may 
be genetic or hormonal. Age-dependent variation in hairi- 
ness has also been reported (11). 

The presence of substantial numbers of terminal hairs 
on the lower back, sternum, abdomen, shoulders, buttocks, 
and inner thighs is considered abnormal. The degree and 
extent of hirsutism are clinically evaluated by using the 
Ferriman-Gallwey scale (12) or a modification (13,14) 
(Fig. 1 and Table 1). This method is useful for assessment 
and follow-up of the response to therapy. 

The variability in the inherent sensitivity of the PSU 
to androgens is demonstrated by the fact that even though 
plasma testosterone levels may be substantially increased, 
the patient may not exhibit appreciable hirsutism (15). 
Familial and ethnic differences exist in the occurrence and 
the degree of hirsutism. The androgen receptors in the 
PSU are specific for the androgen dihydrotestosterone 
(DHT). The action of 5cc-reductase is necessary to convert 




Fig. 1. Diagram showing anterior (left) and posterior (right) sites of involvement in assessment of 
degree and extent of hirsutism on the basis of the Ferriman-Gallwey scale. See Table 1 for defini- 
tions of hair gradings at each site. Modified from Ferriman D, Gallwey JD. Journal of Clinical 
Endocrinology & Metabolism. 1961;21:1442-1443. Table 1. ©The Endocrine Society. 



AACE Hyperandrogenism Guidelines, Endocr Pract. 2001;7(No. 2) 123 



Table 1 

Definition of Hair Gradings at Each of 19 Sites (Ferriman-Gallwey Scale)* 



Site Grade Definition 



! 



l upper lip 


1 


A few hairs at outer margin 




r\ 
Z 


A small mustache at outer margin 




3 


A mustache extending halfway from outer margin 




A 

*T 


r\ IIlUMaVHv CAlCUUlIlg LU uuuilllv 


2 Chin 


1 
1 


A few scattered hairs 




Z 


Scattered hairs with small concentrations 




1 Rr A 


v^uiiipicic cover, ugiu oiiu neavy, icbjjct^u vviy 


3 Mdeburns 


1 
1 


Few nonterminal hairs 




2 


More nonterminal hairs 




J 


TVufni-nal hair /\t\ cirta rtf 

i ci nundi .nair on aiue ui lace 




A 

*T 


Tprrriinal hair ^vtAnHitio fit tnnnHihlf* 
IClIlllIlal Hall CAlCIlUIIIg lO IllallUIUIC 


4 INcCK 


1 
1 


TCW Hallo OH IICCK. 




9 
Z 


lviore nans on newv 






v^ompieie cover, ugnt ana neavy, respectively 


j L-neSt 


1 
1 


i^ircumareoiar ndirs 




z 


With midline hair in addition 




3 


Fusion of these areas, with three -quarter cover 




4 


Complete cover 


6 Upper back 


i 
1 


A few scattered hairs 


9 
Z 


ivduici more, sun scatieicu 




3 & *f 


L/Ompieie cover, ngm ana neavy, respectively 


7 Lower back 


1 
i 


a. sac nil run 01 nair 




Z 


witn some lateral extension 




^ Mr A 


i ill cc-UjUai ici wovci oi coiiiuieic tuvci, icspcv/iivciy 


5 oUttOCKS 


1 Ot Z 


rew or many nairs, respectively, over lower duiiock.s 




1 Mr A 


Hair pvtAfiHina trv nnnpr hiittrmt"c li&Vit tin H lif»nw rf»cr»f»pH\/f*1\/ 

nair eAieuumg to upper ouuocas, ugnt aiiu neavy, respectively 


• 

9 Upper abdomen 


1 


A few midline hairs 


9 
Z 


rvaiiicr more, sun miumie mvoivciiieui 




J Ot H 


nail aiiu lull Luvci, icojjcciivciy 


10 T nwpr flHHnmpn 

iv i—t\J vwi ai/vivuii^/ti 


1 


A few midline hairs 




2 


A midline streak of hair 




3 


A midline band of hair 




4 


An inverted V-shaped growth 


1 1 Tneuinal ares. 

1 1 -111 £2 14*1 J (41 Ul VU 


1 


Pubic hair extending to inguinal area 




2 


A few hairs below inguinal area 




3 &4 


Complete cover below inguinal area, light and heavy, respectively 


12 Perianal area 


1 


Hair encircling introitus and anus 




2 


Hair extending to inner thigh 




3 & 4 


Hair on inner thigh and buttocks, light and heavy, respectively 


13 Arm 


1 


Sparse growth affecting no more than a quarter of the limb surface 




2 


More than this; cover still incomplete 




3&4 


Complete cover, light and heavy, respectively 


14 Forearm 


1,2,3,4 


As for arm 


15 Thigh 


1,2,3,4 


As for arm 


16 Leg 


1,2, 3,4 


As for arm 


17 Foot 


1 


A few hairs on dorsum of foot 




2 


More hair on dorsum of foot 




3&4 


Hair over one-half or three-quarters or more, respectively, of dorsui 


18 Toes 


1 &2 


Few hairs or many hairs, respectively, on big toe 




3 & 4 


Few hairs or many hairs, respectively, on other toes 


19 Fingers 


1 


Few hairs on proximal phalanx — dorsal surface 




2 


Many hairs on proximal phalanx— dorsal surface 




3 


Few hairs on 2nd phalanx— dorsal surface 




4 


Many hairs on 2nd phalanx— dorsal surface 



Total score 



♦See Figure 1 for depiction of sites. Grade 0 at all sites indicates absence of terminal hair. 
Modified from Ferriman and Gallwey (12). ©The. Endocrine Society. 
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testosterone to DHT in the hair follicle. Women with hir- 
sutism may have increased levels of 5oc-reductase. 
Measurement of plasma DHT, however, is not useful in 
assessing hirsutism, acne, or androgenetic alopecia (16). 

The rapid development of hirsutism may be associat- 
ed with severe hyperandrogenemia, other PSU changes, 
and ovarian dysfunction. Rapid onset and very high andro- 
gen levels should alert the clinician to the possibility of a 
neoplasm (adrenal or ovarian). 

Androgenetic Alopecia 

Alopecia or hair loss may be a clinical result of andro- 
gen excess. Fifteen percent of reproductive-age women 
with the manifestation of alopecia and no other signs of 
hyperandrogenism have hyperandrogenemia (17). Typi- 
cally, the hair loss occurs at the vertex, but it may affect 
the crown later and eventually produce a diffuse pattern of 
hair loss. With the pronounced hyperandrogenemia found 
in virilizing states, one may see the typical male pattern 
balding including bitemporal hair loss. The latter finding 
is often associated with severe cutaneous manifestations 
of androgen excess, clitorimegaly, and menstrual dysfunc- 
tion including amenorrhea. 

Other factors such as genetic predisposition, nutri- 
tional deficiencies, recent weight loss, anemia, and thyroid 
dysfunction are responsible for the alopecia in many 
women without hyperandrogenemia. Certain drugs, 
including danazol, anabolic agents, and isotretinoin, may 
also cause hair loss. 

Virilization 

Virilization, characterized by clitoral hypertrophy, 
deepening of the voice, androgenic muscle development, 
breast atrophy, severe hirsutism, male pattern baldness, 
and masculine habitus, is associated with severe hyperan- 
drogenemia attributable to adrenal or ovarian tumors, 
hyperthecosis, or CAH. A rapidly progressing syndrome 
of androgen excess, usually with pronounced oligomenor- 
rhea or amenorrhea, should prompt the clinician to suspect 
a virilizing state (18). 

Ovulatory Dysfunction 

Women with hyperandrogenemia have various 
degrees of ovulatory dysfunction (19), which may lead to 
infertility (20,21). The ovulatory abnormalities are 
brought to the physician's attention and expressed clini- 
cally as menstrual dysfunction — for example, oligomenor- 
rhea, amenorrhea, menorrhagia, metrorrhagia, pelvic pain, 
premenstrual dysphoric syndrome, and disturbances in fer- 
tility. The manifestations of menstrual dysfunction range 
from sporadic episodes of oligo-ovulation or anovulation 
to amenorrhea. Typically, women with ovulatory dysfunc- 
tion associated with hyperandrogenemia can have a nor- 
mal or delayed menarche (22), which is frequently fol- 
lowed by irregular menses and episodes of amenorrhea. 

In addition to hyperandrogenemia, other factors may 
have an influence on ovulatory activity. Examples include 
obesity, eating disorders (manifested by various forms of 
anorexia or bulimia), hyperprolactinemia, hypothalamic 



dysfunction, and psychopharmacologic therapy. All these 
conditions may be associated with severe menstrual dys- 
function, including amenorrhea, and must be excluded. At 
initial assessment, a considerable proportion of women 
with hyperandrogenism may have regular menses (at 4- 
week intervals) (6). Some, however, may have poor ovu- 
latory activity, prolonged follicular phase, shortened luteal 
phase, or anovulation (20). Indeed, spontaneous ovulation 
and pregnancies may also occur (23). Nevertheless, fecun- 
dity, the average time for pregnancy to occur, may be 
delayed, and the incidence of spontaneous miscarriages is 
increased (24). 

Metabolic and Cardiovascular Consequences of 
Hyperandrogenism 

The relationship between the most common condition 
associated with hyperandrogenism — PCOS — and long- 
term metabolic disorders such as insulin resistance, type 2 
diabetes mellitus, and dyslipidemias is now well recog- 
nized (25,26). Hypertension may also be associated with 
hyperandrogenemia (27). Recent data clearly demonstrate 
the role of hyperinsulinemia and insulin resistance in 
PCOS (28); these factors have been implicated in the 
development of atherosclerosis and a predisposition to 
coronary artery disease and type 2 diabetes mellitus (29). 
Impaired glucose tolerance and type 2 diabetes mellitus 
are found in 40 to 45% of patients with PCOS at the time 
of initial examination (30,31). 

Frequently, the clinical and biochemical findings of 
hyperandrogenism in women are forerunners of metabolic 
disturbances that have a major effect on their health (32). 
The association with obesity also potentiates the probabil- 
ity of their occurrence. Some studies have also found the 
presence of hyperinsulinemia and insulin resistance in 
lean women with PCOS (33), particularly those with 
oligomenorrhea. 

The finding of increased low-density lipoprotein cho- 
lesterol concentrations in most obese and some lean 
women with PCOS (30) provides a metabolic basis for the 
report of increased atheroma in hirsute compared with 
nonhirsute women undergoing angiography (34). 

The focus is shifting to the possibility of thwarting the 
metabolic complications by early diagnosis and treatment 
of PCOS. Therapeutic strategies may include insulin-sen- 
sitizing agents and other modalities together with weight 
reduction in obese women with PCOS, which may 
decrease the risks of the dysmetabolic syndrome, also 
called syndrome X and the insulin resistance syndrome 
(35). The features of the dysmetabolic syndrome are listed 
in Table 2. To date, no large published series have defined 
the natural history of PCOS, but it is prudent that, with the 
data available, a vigorous attempt be made to reduce the 
risk factors for possible development of dysmetabolic 
syndrome. 

Psychologic Dysfunction 

Cystic acne, hirsutism, and alopecia, the clinical 
expressions of hyperandrogenemia, can have a devastating 
psychosocial effect in young girls and women of repro- 
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Table 2 

Components of the Dysmetabolic Syndrome* 

Dyslipidemia 

Triglycerides >140 mg/dL or high-density lipoprotein cholesterol <40 mg/dL or 
Low-density lipoprotein cholesterol particle size <260 angstroms (26 nm) 

Insulin resistance 

Fasting plasma glucose >1 10 mg/dL or 
Type 2 diabetes mellitus 

Obesity 

Body mass index >25 kg/m 2 or 
Waist-to-hip ratio >0.85 or 
Waist circumference >100 cm 

Hypertension 

Blood pressure 140/90 mm Hg 

Other abnormalities 

High level of serum uric acid 

High level of plasminogen activator inhibitor- 1 

*A diagnosis of dysmetabolic syndrome requires the presence of criteria for at least 2 of 
the first 3 components. 



ductive age. These manifestations may be associated with 
severe anxiety and depression. In addition, symptoms 
resembling those frequently classified as part of the pre- 
menstrual syndrome, such as dysphoria, are frequent com- 
plaints of patients with ovulatory dysfunction, particularly 
when associated with hyperandrogenemia (36). 

The occurrence of obesity in conjunction with 
hyperandrogenism can have a further negative effect on 
self-esteem and self-image. The fear of social rejection 
can make some women reclusive and may retard their 
development of social skills and confidence. Such women 
are usually helped considerably once an accurate 
diagnosis of the underlying endocrine and ovarian disor- 
ders is obtained. Correction of the underlying pathophysi- 
ologic condition can help ameliorate the psychologic 
dysfunction. 

DIAGNOSTIC EVALUATION OF 
HYPERANDROGENISM 

In most cases, the symptoms usually associated with 
hyperandrogenism (hirsutism, menstrual dysfunction, and 
infertility) are not brought to the attention of a physician 
for evaluation until the patient is in her late teens, early 
20s, or later. Clearly, the most common causes of hyper- 
androgenism begin in early adolescence; therefore, every 
attempt should be made to diagnose and treat these condi- 
tions as early as possible. The diagnostic responsibility 
lies with the primary-care physicians, pediatricians, and 
gynecologists, particularly those who deal with "adoles- 
cent gynecology," who first come in contact with these 
young patients (37,38). 



The diagnostic evaluation is designed to provide 
information about the specific androgens involved (for 
example, testosterone, free testosterone, and dehy- 
droepiandrosterone sulfate [DHEAS]), the degree of 
hypersecretion, the organs of origin of the androgen 
excess (such as the ovaries or adrenal glands), and the 
pathogenesis of the excess androgen production. 
Furthermore, the evaluation must determine whether the 
excessive production of the androgens is due to organ dys- 
function, hyperplasia, or a neoplasm. In addition, the 
degree of the effect of the hyperandrogenism on the 
integument, reproductive system, cardiovascular system, 
and metabolic function should be determined. 

History and Physical Examination 

A thorough history and physical examination provide 
the most important initial diagnostic information, whereas 
laboratory tests should usually serve to confirm the pres- 
ence of hyperandrogenemia. The history should include 
information about the patient's age at thelarche, adrenar- 
che, and menarche. If obesity is present, the time of onset 
and the progression should be noted. The character of the 
menstrual cycles (their frequency, duration, and occur- 
rence of dysmenorrhea) along with the reproductive histo- 
ry, including miscarriages, should be evaluated. The 
patient should be questioned about the age at onset and the 
progression of the following factors: hirsutism, acne, 
excessive sebum, seborrhea, and alopecia. One should also 
note medications used and their effects on acne and hir- 
sutism. The family history is important in determining 
whether other family members have hirsutism, acne, 
infertility, diabetes mellitus, cardiovascular disease, 
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dyslipidemia, or obesity. The presence of premature bald- 
ing (<35 years old) in male siblings of women with hyper- 
androgenism has been described (39). 

A complete physical examination including pelvic 
examination or pelvic ultrasonography (or both) is 
required. In addition to the usual vital signs, the measure- 
ments of height, weight, and waist circumference and the 
determination of the body mass index and the waist-to-hip 
ratio (WHR) are essential (40). The WHR (normal 
women, <0.8) and body mass index are important in 
assessing the degree of obesity in women with hyperan- 
drogenism. A longitudinal 4-year follow-up of 32,898 
women who were 55 to 69 years of age and had an 
increased WHR revealed a significantly increased risk of 
mortality attributable to coronary artery disease (41). 

Particular attention should be paid to the degree and 
distribution of cutaneous manifestations of androgen 
excess (hirsutism, acne, and alopecia). The degree of hir- 
sutism can best be documented and graded by using a sys- 
tem such as the one presented in Figure 1 and Table 1. In 
addition, the presence of clitoral hypertrophy and acantho- 
sis nigricans (an epiphenomenon of brown and sometimes 
verrucous hyperpigmentation on the sides and back of the 
neck, axillae, submammary region, subpanniculus areas, 
perineum, or vulva) should be noted. Acanthosis nigricans 
as well as the presence of skin tags, usually in the neck 
area, may be indicators of insulin resistance. Examination 
of the thyroid gland and breasts (presence of galactorrhea) 
should be emphasized. A thorough pelvic examination 
should include an inspection of the vulva for the presence 
of clitoral hypertrophy and of the cervix for the state of 
dilatation of the cervical os and presence and type of cer- 
vical mucus. This last factor may be of considerable help 
in determining the patient's ovulatory status. The uterus 
should be examined for size and the presence of tumors. 
Similarly, the adnexae should be examined for the pres- 
ence of masses. In the obese patient, pelvic examination 
can be imprecise and limited. Under these circumstances, 
pelvic ultrasonography is imperative for identifying patho- 
logic changes in the pelvic organs. 

Although a pelvic ultrasound study demonstrates typ- 
ical subcortical ovarian cysts ranging from 5 to 10 mm and 
having an increased stroma in most women with PCOS, 
these morphologic findings are nonspecific and may be 
found in other diseases or in normal women (42). Similar 
ultrasound patterns can be found in nonclassic adult-onset 
CAH, hyperprolactinemia, and thyroid dysfunction and in 
perimenarchal girls. Thus, the ultrasound picture of a 
polycystic ovary should not be used as a criterion for the 
diagnosis of PCOS. Despite the nonspecificity of pelvic 
ultrasonography, it can reveal ovarian size, the nature of 
ovarian follicles and stroma, the state of the endometrium, 
the response to therapy, and the diagnosis of ovarian neo- 
plasms (primarily dermoids) (43). 

Laboratory Studies 

Although the evaluation of androgen levels and their 
site of secretion in patients with suspected hyperandro- 
genism is essential, other endocrine studies, particularly as 



suggested by the history and physical examination, are in 
order. The determination of luteinizing hormone (LH), 
follicle-stimulating hormone (FSH), prolactin, glucose, 
and lipid levels is of particular interest during the initial 
laboratory evaluation. 

The laboratory evaluation of the patient with clinical 
signs of hyperandrogenism necessitates specific knowl- 
edge and assessment of the endocrine testing being per- 
formed. Special care must be exercised to select a labora- 
tory known to the physician to be proficient in androgen 
determinations — that is, a laboratory that has made an 
effort to obtain appropriately timed blood samples and has 
reliably determined the "normal" or the "reference" ranges 
for the hormones tested. Many commercial laboratories do 
not provide accurate, sensitive, and reproducible hormone 
determinations, particularly for androgens. The most chal- 
lenging problems are encountered with the determination 
of testosterone levels in normal female patients who have 
testosterone levels in the low range of assay detection. 
Many commercial laboratories have inaccurate normal 
ranges for testosterone levels, as evidenced by the fact that 
most research publications report much lower normal - 
range levels for women without hyperandrogenism than 
do commercial laboratories. Studies have shown that the 
same testosterone value measured by different laboratories 
may be designated as normal to abnormally high when 
compared with the supplied reference ranges (44,45). 
Thus, an acceptable clinical laboratory should provide 
precise results, a reliable reference range, and quality con- 
trol that ensures stable values over extended periods. 
These features are crucial because long-term therapy for 
hyperandrogenism is frequently necessary. 

During the initial assessment of a patient with sus- 
pected hyperandrogenism, the following plasma hormone 
concentrations could be determined during fasting in a 
specimen obtained during the first 7 days of the menstrual 
cycle: total and free testosterone, DHEAS, prolactin, LH 
and FSH, and 1 7-hydroxyprogesterone (17-OHP). 

The determination of plasma free testosterone may 
help detect the presence of subtle hyperandrogenemia in 
some patients in whom the total testosterone level is nor- 
mal. Of note, obese women may have a high free testos- 
terone value because of reduced sex hormone-binding 
globulin (SHBG) attributable to hyperinsulinemia (46). A 
high DHEAS level may indicate an adrenal factor in 
androgen production and, if substantially elevated, the 
presence of an adrenocortical neoplasm. The usefulness of 
a high LH level or an elevated LH/FSH ratio (>2.5) in 
diagnosing the presence of polycystic ovarian state in 
women with hyperandrogenism is limited, inasmuch as at 
least a third of the patients with PCOS do not demonstrate 
this abnormality (47). In reference to reproductive func- 
tion, however, a high LH level is associated with a poor 
response to attempted induction of ovulation and an 
increased risk of miscarriage (48). 

Measurement of plasma 17-OHP levels is useful in 
diagnosing adrenal hyperandrogenemia as a result of 21- 
hydroxylase deficiency. A high 17-OHP level suggests the 
presence of this enzyme deficiency disease. With a suspi- 
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cious family history of CAH or in Ashkenazi women, the 
use of a cosyntropin stimulation test may be necessary to 
exclude this enzyme deficiency (49). 

Many clinicians have found that women with clinical 
signs of hyperandrogenism who undergo initial assess- 
ment after the age of 35 years may not demonstrate hyper- 
androgenemia (Futterweit W. Unpublished data). This 
finding may prevail because, with aging, the hyperandro- 
genic activity (particularly that of the ovaries) decreases 
and the hirsutism will persist. Therefore, follow-up moni- 
toring of such women for the development of associated 
metabolic and cardiovascular sequelae is important. 

DETERMINATION OF SITE OF 
ANDROGEN PRODUCTION 

The hypersecretion of androgens can be attributed to 
the ovaries, the adrenal glands, or the peripheral conver- 
sion of androgen precursors. From the standpoint of ther- 
apeutic strategy, determining the degree of contribution of 
the androgens originating from each site is critical. For 
this purpose, several endocrine function tests are avail- 
able. 

Adrenal Suppression Tests 

Two-Day Suppression Test 

The following steps are involved in the 2-day dexa- 
methasone suppression test (50). 

Day 1 

• Beginning at 8 to 9 am, obtain three blood 
samples at 20-minute intervals. Equal aliquots 
of each sample are pooled to provide the base- 
line sample (before administration of dexa- 
methasone). 

• Dispense to the patient eight 0.5-mg dexa- 
methasone tablets. 

• The patient takes one tablet of dexamethasone 
at lunch, dinner, and bedtime. 

Day 2 

• The patient takes one tablet of dexamethasone 
at breakfast, lunch, dinner, and bedtime. 

Day 3 

• The patient takes one tablet of dexamethasone 
at breakfast. 

• Subsequently, beginning within 2 hours, three 
blood samples are obtained every 20 minutes. 
Equal aliquots of each sample are pooled to 
provide the "postdexamethasone" sample. 

• Testosterone, DHEAS, and Cortisol are 
measured on pooled blood samples obtained 
before and after administration of dexa- 
methasone. 



Alternative Suppression Tests 

Some authorities suggest the use of 4 to 6 days of 
suppression with dexamethasone (51,52). These tests are 
conducted in the same fashion as the 2-day suppression 
test. Some investigators believe that such tests yield no 
substantially greater amount of information than the 2-day 
dexamethasone suppression study. 

Interpretations 

If the high level of testosterone is suppressed more 
than 40% and DHEAS is suppressed more than 60% after 
administration of dexamethasone, the source of the 
increased androgen is most likely the adrenal glands (19). 

If the testosterone level fails to be suppressed but 
DHEAS and Cortisol respond, the source of testosterone is 
primarily the ovaries. 

If testosterone suppression is less than 40%, a com- 
bined ovarian and adrenal contribution is responsible for 
the excessive testosterone secretion. 

If lack of androgen suppression is associated with a 
lack of Cortisol suppression, the patient most likely has 
adrenal hyperfunction (for example, Cushing's disorder or 
adrenal cancer) or the patient may have failed to take the 
dexamethasone tablets as directed. Appropriate steps must 
be taken to determine which of these two possibilities is 
the likely explanation. 

Stimulation Tests 

Adrenal Stimulation 

The cosyntropin stimulation test is conducted for 
detection of the various steroidogenic enzyme deficiencies 
of the adrenal glands, predominantly 21 -hydroxylase defi- 
ciency (49). This test is indicated only if a screening morn- 
ing 17-OHP level is above normal or the suspicion of such 
an enzymatic deficiency is high, as in Ashkenazi and 
Hispanic women. A morning plasma Cortisol level and 17- 
OHP level are determined, and 0.25 mg of cosyntropin is 
administered intravenously. Blood sampling is repeated at 
60 minutes. Specific testing for other adrenocortical 
defects may also be performed to exclude the presence of 
3P-ol dehydrogenase and 1 1 P-hydroxylase deficiencies 
(49). 

Ovarian Stimulation 

The gonadotropin-releasing hormone (GnRH) stimu- 
lation test (53) is conducted to confirm the ovarian origin 
of the excessive androgens. This test involves suppression 
of the adrenal glands with dexamethasone and, while the 
adrenal glands are suppressed, stimulation of the ovaries 
with a GnRH agonist (for example, nafarelin). Blood 
samples are collected at various time intervals during a 25- 
hour period. The hypersecretion of 17-OHP is the end- 
point of the test, indicating ovarian involvement. This test 
is useful only in clinical research of PCOS to identify the 
presence of 1 7-hydroxylase dysfunction. 
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DETERMINATION OF DEGREE OF 
ANDROGEN-INDUCED CHANGES 

Integument 

The degree and distribution of acne and hirsutism, as 
well as alopecia when present, should be assessed with use 
of the Ferriman-Gallwey scoring technique (Fig. 1 and 
Table 1) or a similar scale (12-14). 

Ovaries 

Menstrual cycles can be defined as ovulatory or 
anovulatory. A simple method to evaluate ovulation is the 
use of a basal body temperature chart, in which a biphasic 
pattern suggests an ovulatory cycle, whereas a flat pattern 
is consistent with anovulation. A serum progesterone 
level of less than 2 ng/mL after the 21st day of the 
menstrual cycle or after a perceived increase in the basal 
body temperature is also consistent with ovulatory 
dysfunction. 

Metabolic Sequelae 

The metabolic sequelae of hyperandrogenemia are of 
major importance and need to be considered during the 
early assessment of patients with clinical signs of hyper- 
androgenism. The increased incidences of hyperinsulin- 
ism and insulin resistance make these patients susceptible 
to impaired glucose tolerance and, ultimately, overt type 2 
diabetes mellitus. Although the "gold standard" for defin- 
ing insulin resistance (other than a euglycemic-hyperinsu- 
linemic clamp study) is the frequently sampled intra- 
venous glucose tolerance test, fasting morning glucose and 
insulin measurements may be a useful screening test to 
determine the presence of these metabolic abnormalities. 
A ratio of fasting glucose/insulin <4.5 has been proposed 
as an indicator of insulin resistance in such patients (54). 
In a recently published study, ree valuation of this issue 
provided evidence that an effective method to determine 
the presence of insulin resistance is to measure the total 
integrated insulin response to orally administered glucose 
(55). The fasting insulin level correlated highly with this 
more elaborate testing. 

In addition to impaired glucose metabolism, dyslipid- 
emia characterized by reduced high-density lipoprotein 
cholesterol and increased triglyceride levels is frequently 
noted (56). These manifestations place patients with 
hyperandrogenism at an increased risk for later cardiovas- 
cular atherogenic complications. Although treatment of 
hyperandrogenism is important for the alleviation of the 
cutaneous (acne and hirsutism) or menstrual (ovulatory 
dysfunction) abnormalities, a major goal of treatment is 
possible amelioration of the metabolic disorders. The nat- 
ural history of the disease has not been studied in a suffi- 
ciently large series of patients with hyperandrogenism. On 
the basis of the available data, however, these patients are 
clearly at risk, and this factor should be at the forefront of 
therapeutic considerations. 



THERAPY FOR HYPERANDROGENISM 

Although the therapeutic approach for hyperandro- 
genism is usually directed at the primary symptom of the 
patient — acne, hirsutism, alopecia, menstrual dysfunction, 
infertility, or an associated metabolic disorder — the pri- 
mary concern is a thorough diagnostic evaluation to deter- 
mine the degree of hyperandrogenism, the site of excess 
androgen production, and the presence of all pathophysio- 
logic and metabolic manifestations. Early recognition of 
the disease process and timely therapeutic intervention 
should be of foremost concern. Frequently, the disease can 
be diagnosed in the perimenarchal or early postmenarchal 
period of the female patient's development. Early treat- 
ment may prevent serious problems with acne, hirsutism, 
dysfunctional bleeding, and infertility and possibly ame- 
liorate the potential later development of metabolic and 
cardiovascular complications (37). 

Therapy may be categorized on the basis of the results 
of dynamic tests and the pathophysiologic findings: 

• Predominantly glucocorticoid-suppressible hyperan- 
drogenism (probably adrenocortical in origin) (57) 

• Combined ovarian and adrenal hyperandrogenism 

• Predominantly ovarian hyperandrogenism 

• Rare forms of hyperandrogenism: 

Congenital adrenal hyperplasia 
Adrenal tumors 
Ovarian tumors 
Hyperthecosis 

The pharmacologic options in the treatment of the 
aforementioned disorders include the following: 

• Suppression of adrenal androgens (administration of 
glucocorticoids, usually in physiologic doses of dexa- 
methasone or prednisone) 

• Suppression of ovarian androgens (administration of 
female sex steroids, in the form of either birth control 
pills or estrogens and progestins, or administration of 
gonadotropin secretion blocking agents — that is, GnRH 
agonists with "add-back" estrogen therapy) 

• Treatment with antiandrogens (for example, spirono- 
lactone and flutamide) 

• Treatment with insulin-sensitizing agents (metformin 
and thiazolidinediones) 

• Treatment with 5a-reductase-inhibiting agents 
(finasteride) 

• Bromocriptine 

Of importance, treatment with antiandrogens, insulin- 
sensitizing agents, and 5cc-reductase-inhibiting agents is 
not specifically approved by the US Food and Drug 
Administration and is contraindicated during pregnancy. 

Nonpharmacologic interventions for hyperandrogenic 
states include the following: (/) weight reduction, (2) 
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surgical excision of a virilizing adrenal or ovarian tumor, 
and (5) electrocautery in patients with PCOS. 

Treatment With Glucocorticoids 

Adrenal hyperandrogenism responds well to gluco- 
corticoid therapy with prednisone or dexamethasone (58- 
60). Usually, 5 to 7.5 mg of prednisone or 0.25 to 0.5 mg 
of dexamethasone is administered daily after supper for 2 
or 3 months. This treatment frequently results in normal- 
ization of the androgens. If the androgen levels are nor- 
malized, the dose is lowered to 5 mg of prednisone or 0.25 
mg of dexamethasone for 2 to 3 months, at which time the 
dose can be halved or totally discontinued. The androgen 
levels should then be assessed every 3 to 4 months for 1 
year for recurrence of hyperandrogenemia. Of note, the 
possibility of recurrence of high testosterone levels is 
greater than recurrence of high DHEAS levels. In most 
instances, DHEAS levels remain suppressed indefinitely 
after treatment (60). Suppression of androgens usually 
results in sustained amelioration of acne and has a minor 
effect on hirsutism (slowing of growth rate and softening 
of hair). Improvement in ovulatory activity, return of fer- 
tility, and increased sensitivity to the effect of clomiphene 
citrate on induction of ovulation often occur with gluco- 
corticoid therapy (17,58,60). 

The long-term adverse effects of minimal-dose gluco- 
corticoid therapy on bone resorption and dysmetabolic 
syndrome have not been scientifically studied. Routine 
follow-up studies of patients have not shown significant 
effects on glucose or lipid metabolism; however, short- 
acting corticosteroids such as prednisone would be less 
likely to pose such risks in comparison with the use of 
dexamethasone. 

Treatment With Oral Contraceptives 

Oral contraceptives (OCs) are used widely for treat- 
ment of hyperandrogenism (61-63). The use of third-gen- 
eration OCs in the treatment of ovarian hyperandrogenism 
appears promising in that little if any androgenic effect has 
been noted with desogestrel and norgestimate, the pro- 
gestins used in these OC agents (64). Acne and mild hir- 
sutism are often decreased. The combination of OCs and 
antiandrogenic agents, particularly spironolactone, is often 
used in patients with moderate to severe hirsutism or 
alopecia (65). A major benefit is the reduction in the inci- 
dence of endometrial and ovarian cancer in patients who 
use OCs (66). Contraindications to their use may be a his- 
tory of phlebitis, severe migraine, substantial weight gain, 
and the risk of increased insulin resistance. Long-term use 
may mask severe ovulatory dysfunction, which may 
progress to anovulation and amenorrheic states that are 
more resistant to induction of ovulation. 

Treatment With Antiandrogens 

Spironolactone 

Spironolactone is an antiandrogen that competes with 
testosterone and DHT at the androgen receptor level. The 



minimal dose should be 100 mg daily in divided dosage 
and may be increased to 200 mg daily as tolerated. The 
combination of spironolactone and OCs is frequently used 

(67) . The LH suppressive effect of OCs makes this com- 
bination treatment more effective than spironolactone 
monotherapy. This combined drug strategy minimizes the 
frequently noted polymenorrhea when spironolactone is 
used alone. Some of the side effects include light-headed- 
ness, fatigue, mood swings, reduced libido, headaches, 
and mastalgia. In patients with androgenetic alopecia, the 
use of spironolactone is effective in improving the rate of 
hair regrowth and preventing further scalp hair loss. 

Cyproterone Acetate 

Cyproterone acetate has not been approved by the US 
Food and Drug Administration. Reports indicate that this 
progestational antiandrogen is effective in the treatment of 
hyperandrogenism. When administered in conjunction 
with ethinyl estradiol, it is equal or perhaps slightly supe- 
rior to combination therapy with spironolactone and OCs 

(68) . 

Flutamide 

Flutamide, an antiandrogen that blocks androgen 
uptake and nuclear binding, is a very effective drug in 
treating hyperandrogenism, but it has the potential, albeit 
infrequent, adverse effect of fatal hepatotoxicity, 
Flutamide should be used cautiously only in the most 
severe cases resistant to other forms of treatment (69). 
Recent data indicate that a dosage of 250 mg daily may be 
as effective as 250 mg twice a day. 

Cimetidine 

An H 2 blocker, cimetidine has not been found to be 
useful in the treatment of hyperandrogenism and is not 
recommended in this setting. 

Ketoconazole 

Ketoconazole is an antifungal agent that has shown 
some effectiveness in treating symptoms of androgen 
excess. Reports of serious hepatotoxicity, however, have 
made this drug treatment of questionable value. 

Treatment With 5o>Reductase Inhibitors 

Finasteride is a 5a-reductase inhibitor that blocks the 
intracellular conversion of testosterone to DHT; thus, the 
amount of DHT available for interacting with the andro- 
gen receptor is reduced. Finasteride has a predominant 
effect on the type 2 isoenzyme of 5oc-reductase that specif- 
ically affects sebaceous gland activity. Reports of its use 
in women have been limited to those with hirsutism, and it 
seems to be comparable to spironolactone in reducing ana- 
gen hair shaft diameter when administered in a daily dose 
of 5 to 7.5 mg. It is associated with minimal gastrointesti- 
nal side effects, and it does not alter menstrual cyclicity. 
The plasma levels of testosterone may increase during 
treatment, whereas the DHT level decreases (70,71). Of 
utmost importance, the patient should be aware that she 
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must avoid pregnancy during treatment with finasteride 
because of the potential for causing ambiguous genitalia in 
a male fetus. 

Treatment With Insulin-Sensitizing Agents 
The advent of drugs that enhance insulin sensitivity 
and action makes agents such as metformin and the 
thiazolidinediones important in the therapeutic strategy for 
patients with hyperandrogenism and some of the 
associated metabolic disturbances — specifically, insulin 
resistance (72,73). Studies have indicated that reducing 
plasma insulin levels substantially ameliorates the 
hyperandrogenism of PCOS. Insulin-sensitizing agents 
may become a choice for initial treatment of women with 
hyperandrogenism, particularly those with PCOS and 
insulin resistance, obesity, and moderate to severe 
oligomenorrhea. 

Metformin 

Reduction in the manifestations of hyperandrogenism 
and improved menstrual function have been reported with 
administration of metformin. Metformin has been admin- 
istered in conjunction with ovulation-inducing agents, 
such as clomiphene citrate, and has not been reported to be 
teratogenic (74). 

Many studies have demonstrated the most dramatic 
reduction in hyperandrogenism in obese subjects with 
PCOS; hence, the issue has been raised whether weight 
reduction alone accounts for this effect. Questions have 
been posed about the reported efficacy of metformin in 
increasing insulin sensitivity in this type of patient 
because of the small numbers of subjects and questionable 
study designs in most reports dealing with this issue (75). 
Recently, however, a carefully designed study suggested a 
definite positive effect of metformin in a relatively large 
population of patients (76). 

The recommended dosage of metformin for treatment 
of hyperandrogenic states is the initiation of therapy with 
850 mg (one tablet) in the morning with breakfast, and 
then increasing the dosage to 1,700 mg after 2 to 3 weeks 
in divided doses with breakfast and dinner. Alternatively, 
metformin therapy can be initiated at 500 mg with dinner 
and increased to 500 mg twice and three times daily or to 
1 ,000 mg twice a day as tolerated. The most common side 
effects are gastrointestinal; they consist of bloating, nau- 
sea, vomiting, and diarrhea and frequently occur during 
initiation of treatment. The occurrence of lactic acidosis is 
extremely rare and is more likely in those patients with 
renal impairment. Use of metformin should be discontin- 
ued before administration of contrast dye, particularly in 
subjects with decreased renal function (77). 

Thiazolidinediones 

Thiazolidinediones have been used alone or in combi- 
nation in the treatment of type 2 diabetes mellitus. The pri- 
mary effect of these drugs is achieved by improving 
insulin sensitivity in muscle and adipose tissue as well as 
inhibiting hepatic gluconeogenesis. The decrease in 



insulin resistance is accompanied by a reduction in 
hyperinsulinemia, without associated weight changes. 
Studies have indicated that the use of troglitazone 
improved insulin sensitivity and subsequently decreased 
insulin-mediated ovarian androgen excess in women with 
PCOS, even in those subjects with severe obesity (78,79). 
In view of the occasional reports of serious hepatic distur- 
bances associated with its use, however, troglitazone was 
removed from the market. Other agents in the thiazo- 
lidinedione class of drugs may prove to be effective in the 
treatment of PCOS, but to date, no scientific studies have 
been published of their use in PCOS. 

Comment 

Overall, the use of insulin-sensitizing agents has 
demonstrated clinical improvement in hirsutism and men- 
strual cyclicity. Improvement in responsiveness to ovula- 
tion-inducing agents (specifically, clomiphene citrate) has 
been reported (74). The effects on improvement of the 
cutaneous manifestations of hyperandrogenism may not 
be as pronounced as those achieved with anti androgen 
treatment in combination with OCs. 

Further studies are needed to determine the exact role 
of metformin and thiazolidinediones as therapeutic agents, 
or possibly initial monotherapy, in the treatment of hyper- 
androgenic states, including PCOS. 

Treatment With GnRH Agonists 

The use of GnRH agonists is most effective in severe 
forms of ovarian hyperandrogenism. Depot preparations 
of GnRH agonists may be administered at monthly inter- 
vals. Because of the severe hypoestrogenemia induced by 
GnRH agonists, however, a concurrent add-back therapy 
with estrogen and progesterone (the latter when the uterus 
is present) is essential (80,81). This therapy will also cor- 
rect the severe vasomotor symptoms and other side effects 
of the resulting hypoestrogenemia, GnRH agonists have 
shown no effect in reducing hyperinsulinism in instances 
of ovarian hyperandrogenism. 

Treatment With Bromocriptine 

The use of bromocriptine, a dopamine receptor ago- 
nist, in a divided dosage of 5 to 7.5 mg daily with meals is 
indicated in the subset of women with hyperandrogenism 
who have hyperprolactinemia. No convincing data indi- 
cate that this treatment is effective in women with hyper- 
androgenism who do not have high levels of circulating 
prolactin (78,79). Bromocriptine improves menstrual 
cyclicity in patients with hyperprolactinemia who have 
PCOS (82) and may reduce some of the associated hir- 
sutism, which is related to augmented production of ad- 
renal androgen attributable to the hyperprolactinemia. 
Treatment with bromocriptine should be initiated gradual- 
ly so as to minimize initial light-headedness, hypotension, 
and nausea (vaginal or rectal administration can also 
reduce symptoms). Alternatively, use of cabergoline in 
dosages of 0.5 mg weekly or twice weekly may be associ- 
ated with fewer side effects. 
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Weight Reduction 

Obesity is present in 55 to 65% of patients with 
PCOS, the most common form of hyperandrogenism, and 
is associated with an increase in the WHR and a history of 
perimenarchal onset. Obesity is frequently associated with 
insulin resistance and hyperinsulinism, which act syner- 
gistically with LH to intensify ovarian hyperandrogenism, 
decrease hepatic production of SHBG, and reduce insulin- 
like growth factor binding protein- 1 (83). Moreover, obe- 
sity reinforces the genetic predisposition to hormonal and 
ovulatory disorders in PCOS. Metabolic complications are 
more common in obese patients with PCOS (impaired glu- 
cose tolerance in approximately 40% of subjects with obe- 
sity, hypertension, dyslipidemias, and possible develop- 
ment of estrogen-dependent tumors) than in lean women 
with PCOS. 

Weight loss in patients with hyperandrogenism, with 
or without the clinical presence of PCOS, should be the 
first therapeutic option because it decreases androgen lev- 
els, increases SHBG, and may restore ovulation. As little 
as a 7% reduction in body weight can restore fertility, 
decrease hirsutism in some women with androgen excess, 
and improve the response to induction of ovulation 
(84,85). Investigators have also demonstrated that abnor- 
malities in 17,20-lyase activity diminish in parallel with 
reduction of hyperinsulinemia (86). 

Individualization of Therapy 

The choice of therapeutic intervention for hyperan- 
drogenism depends on several factors: 

• The source of the hyperandrogenism 

• The goal of therapy 

• The long-term risks and benefits 

The evaluation and treatment of hyperandrogenism 
and hyperandrogenemia should begin as early as possible 
after the onset of symptoms. Often, mild symptoms, such 
as menstrual irregularities, hirsutism, and acne, are over- 
looked or discounted, and the result is their progression. 
For example, adolescent patients diagnosed with a hyper- 
androgenic disorder should undergo assessment for treat- 
ment based on the cause of the hyperandrogenism, the 
long-term goals, and the potential benefits and risks. 

For those patients with glucocorticoid-suppressible 
hyperandrogenism, successful treatment for 1 to 2 years 
may yield a long-term remission and prevent metabolic 
and ovulatory disorders (60). 

In non-glucocorticoid-suppressible hyperandro- 
genism, the use of an OC-anti androgen combination may 
be effective in treating the hyperandrogenemia but may 
not ameliorate the long-term metabolic and ovulatory 
abnormalities. Therefore, patients should be counseled 
that discontinuation of therapy may lead to recurrence of 
the hyperandrogenic problems. 

Insulin-sensitizing drugs have been shown to be 
effective in short-term studies, but long-term outcomes are 
as yet unknown. Those patients, regardless of age, with 
severe forms of PCOS, such as those with Jnsulin 



resistance, obesity, and acanthosis nigricans, are candi- 
dates for this therapeutic modality. 

CONCLUSION 

The primary responsibility of the physician in the 
assessment of a patient with symptoms associated with 
hyperandrogenism — for example, acne, hirsutism, repro- 
ductive disorders, or metabolic diseases — must be to 
determine the presence of high levels of androgens and 
their source. Careful attention should be focused on 
whether damage to organ systems other than the one 
responsible for the patient's major complaint is also pres- 
ent. Early diagnosis is critical to an effective therapeutic 
strategy. The adolescent patient with hyperandrogenism 
may not undergo thorough assessment before being treat- 
ed symptomatically for irregular menses, hirsutism, or 
acne. Many young women are given oral contraceptives to 
"regulate menstrual cycles" or to "improve acne" without 
a careful determination of the source of the hyperandro- 
genism and selection of the most appropriate therapy for 
maximizing the long-term benefits (37,38), 

Adequate diagnostic evaluation hinges on the avail- 
ability of a laboratory capable of accurate measurement of 
low levels of androgens, particularly testosterone, and the 
ability of the laboratory to provide reproducible results. 
The physician should ensure that the laboratory perform- 
ing these highly specialized hormone assays provides 
accurate and reproducible results. 

The initial goal of therapy is to lower the androgen 
levels, which should occur within the first 2 or 3 months 
of therapy. Once this goal has been achieved, evaluation of 
the degree of the return of function in organs originally 
impaired by hyperandrogenism should be undertaken. 
Then, if necessary, direct therapy toward further improve- 
ment of these functions can be instituted. 

Management of non-tumor-related hyperandrogene- 
mia and polycystic ovaries necessitates careful long-range 
monitoring by an endocrinologist for proper maintenance 
of a euandrogenemic state. Vigilance must be exercised to 
detect any signs of development of the metabolic conse- 
quences of hyperandrogenism, including diabetes melli- 
tus, hypertension, dyslipidemias, or atherosclerosis. 
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The objective of the present study was to estimate the prev- 
alence of the different pathological conditions causing clini- 
cally evident androgen excess and to document the degree of 
long-term success of suppressive and/or antiandrogen hor- 
monal therapy in a large consecutive population of patients. 

All patients presenting for evaluation of symptoms poten- 
tially related to androgen excess between October 1987 and 
June 2002 were evaluated, and the data were maintained pro- 
spectively in a computerized database. For the assessment of 
therapeutic response, a retrospective review of the medical 
chart was performed, after the exclusion of those patients 
seeking fertility therapy only, or with inadequate follow-up or 
poor compliance. 

A total of 1281 consecutive patients were seen during the 
study period. Excluded from analysis were 408 patients in 
whom we were unable to evaluate hormonal status, determine 
ovulatory status, or find any evidence of androgen excess. 
In the remaining population of 873 patients, the unbiased 
prevalence of an dro gen-secreting neoplasms was 0.2%, 21- 
hydroxylase-deficient classic adrenal hyperplasia (CAH) was 
0.6%, 21-hydroxylase-deficient nonclassic adrenal hyperpla- 
sia (NCAH) was 1.6%, hyperandrogenic insulin-resistant 



acanthosis nigricans (HAIRAN) syndrome was 3.1%, idio- 
pathic hirsutism was 4.7%, and polycystic ovary syndrome 
(PCOS) was 82.0%. Fifty-nine (6.75%) patients had elevated 
androgen levels and hirsutism but normal ovulation. A total of 
257 patients were included in the assessment of the response 
to hormonal therapy. The mean duration of follow-up was 33.5 
months (range, 6-155). Hirsutism improved in 86%, menstrual 
dysfunction in 80%, acne in 81%, and hair loss in 33% of pa- 
tients. The major side effects noted were irregular vaginal 
bleeding (16.1%), nausea (13.0%), and headaches (12.6%); only 
36.6% of patients never complained of side effects. 

In this large study of consecutive patients presenting with 
clinically evident androgen excess, specific identifiable dis- 
orders (NCAH, CAH, HAIRAN syndrome, and androgen-se- 
creting neoplasms) were observed in approximately 7% of sub- 
jects, whereas functional androgen excess, principally PCOS, 
was observed in the remainder. Hirsutism, menstrual dys- 
function, or acne, but not alopecia, improved in the majority 
of patients treated with a combination suppressive therapy; 
although more than 60% experienced side effects. (J Clin En- 
docrinol Metab 89: 453-462, 2004) 



ANDROGEN EXCESS IS one of the most common en- 
docrine disorders of reproductive-aged women, af- 
fecting approximately 7% of this population (1-3). Androgen 
excess results in the development of androgenic features in 
the women affected, with the development of hirsutism, 
androgenic alopecia, acne, ovulatory dysfunction, and, if 
extreme and prolonged, even virilization and masculiniza- 
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tion. Disorders that result in androgen excess include specific 
identifiable disorders {i.e. disorders of inclusion) such as 
non-classic adrenal hyperplasia (NCAH), hyperandrogenic 
insulin-resistant acanthosis nigricans (HAIRAN) syndrome, 
and androgen-secreting neoplasms (ASNs). Alternatively, a 
number of androgen excess disorders are diagnosed by ex- 
clusion, such as the polycystic ovary syndrome (PCOS) and 
idiopathic hirsutism (IH), termed disorders of functional an- 
drogen excess (FAE). 

Notwithstanding their clinical importance, the prevalence 
of the different pathological conditions causing or associated 
with androgen excess remains unclear. Although the most 
recognizable clinical feature of androgen excess may be hir- 
sutism, it should be noted that not all patients with hirsutism 
have overt evidence of androgen excess, with some women 
suffering from what we understand to be IH (4). Alterna- 
tively, not all patients with an androgen excess disorder have 
hirsutism, as in the Asian patient with PCOS (5). However, 
in most studies, hirsutism has been used as a surrogate 
marker for androgen excess, with the largest study to date of 
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consecutive patients potentially affected by androgen excess 
including only subjects presenting with hirsutism and /or 
androgenic alopecia (6). This has made it difficult to accu- 
rately assess the frequency of the lesser common androgen 
excess disorders. 

The response of hirsutism, ovulatory dysfunction, and 
other features of androgen excess to hormonal therapy is 
important to determine but has been studied primarily with 
the use of individual agents. However, combination therapy 
including oral contraceptives (OCs), antiandrogens, or met- 
formin has been suggested to be superior to monotherapy 
(7-9). Unfortunately, reports evaluating the results of these 
regimens have generally included 50 or fewer patients (7-13), 
limiting our assessment of the efficacy of this therapeutic 
regimen. 

The objective of the present study was to report on our 
experience evaluating over 1000 consecutive patients con- 
sulting for symptoms potentially related to androgen excess. 
From this data, we have estimated the prevalence of the 
different pathological conditions causing clinically evident 
androgen excess and further documented the long-term suc- 
cess of suppressive and /or antiandrogen hormonal therapy 
in the treatment of these patients. 

Subjects and Methods 

Subjects 

All patients presenting for the evaluation of symptoms potentially 
related to androgen excess to the reproductive endocrinology clinic (to 
R.A.) at the University of Alabama at Birmingham between October 1987 
and June 2002 were included. The Institutional Review Board for Human 
Use at the University of Alabama at Birmingham approved the study. 
Patients evaluated included those presenting with oligo /amenorrhea, 
ovulatory dysfunction, excess hair growth, virilization, alopecia, or acne. 
The data were recorded and maintained prospectively in a computer- 
ized database (Alpha Four version 6.0; Alpha Software Corp., Burling- 
ton, MA). 

Initial subject evaluation 

All patients completed a uniform history form and underwent a 
complete physical examination. The following parameters were re- 
corded prospectively: height, weight, race, age, gravidity, parity, degree 
of ovulatory /menstrual dysfunction, presence of acne, and hirsutism 
score. The body mass index (BMI) was calculated as kilograms per 
square meter. Beginning January 1993, the abdominal and hip circum- 
ferences were assessed as published previously (14), and the waist to hip 
ratio (WHR) was calculated. The presence of acne was recorded, but the 
severity was not generally scored. Excess body and facial terminal hair 
growth was assessed using a modified Ferriman-Gallwey (mF-G) hir- 
sutism score (15). 

The interval between bleeding episodes was assessed prospectively 
and classified as less than 26 d, 27-34 d, 35-44 d, 45 d to 3 months, and 
more than 3 months duration. This classification was established per the 
reported variations in menstrual cycle length observed by Treloar et al. 
(16). Patients with cycles greater than 35 d or less than 26 d were deemed 
to be oligo/ anovulatory. In addition, beginning January 1995, ovulatory 
function was confirmed in all eumenorrheic (i.e. with a bleeding interval 
of 27-34 d) hirsute women by using the basal body temperature and 
measuring a cycle d 22-24 progesterone (P4) level. A level of P4 greater 
than 12.7 nmol/ liter (4 ng/ml) was considered to represent ovulation, 
and eumenorrheic patients were then classified as "27-34 d plus ovu- 
lation" or "27-34 d plus oligo-ovulation." 

In patients who had not received hormonal therapy for 3 months 
before their initial visit, the serum levels of total testosterone (T), free T, 
dehydroepiandrosterone sulfate (DHEAS), and 17-hydroxyprogester- 
one (17-HP) were obtained and recorded. Blood sampling for androgens 
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was performed without regard to the time of the cycle or the day, 
although an effort was made to measure the 17-HP in the preovulatory 
phase of the menstrual cycle (17). Hyperandrogenemia was defined as 
an androgen value above the 95th percentile of 98 healthy control 
women [i.e. a total T > 2.94 nmol/liter (88 ng/dl), free T > 0.026 
nmol/liter (0.75 ng/dl), or DHEAS > 6.64 jimol/liter (2750 ng/ml)], as 
reported previously (1). 

Patients with a basal level of 17-HP greater than 6.0 nmol/liter (2 
ng/ml) underwent either a repeat 17-HP test or proceeded directly to an 
ACTH stimulation test, as described previously (17). If the repeat 17-HP 
was 6.0 nmol/liter or greater, patients underwent an acute ACTH stim- 
ulation test (see below). Patients with levels of total T above 8.67 nmol/ 
liter (250 ng/dl) on at least two separate occasions, or who were deemed 
to demonstrate clinical features suggestive of an ASN (e.g. virilization), 
underwent a transvaginal sonogram and a computerized tomography 
scan of the adrenals at 5-mm intervals to exclude ovarian and adrenal 
neoplasms, respectively (18). 

Patients with evidence of ovulatory dysfunction also underwent mea- 
surements of serum prolactin and thyroid-stimulating hormone levels 
to exclude a prolactin-secreting adenoma and thyroid dysfunction, re- 
spectively. If clinically suspected, screening for Cushing's syndrome 
was performed by either the overnight dexamethasone suppression test 
(i.e. the measurement of a Cortisol level the morning after the admin- 
istration of 1 mg dexamethasone orally at bedtime) through January 1990 
or by measuring the 24-h urine-free Cortisol content after that date. 

In patients diagnosed with PCOS or HAIRAN syndrome (see below), 
fasting glucose (GLU) and insulin (INS) levels were obtained regularly 
after April 1996. Before those dates, basal and or GLU -stimulated INS 
and GLU levels were obtained selectively, primarily in patients clinically 
suspected of having INS resistance (e.g. by the presence of acanthosis 
nigricans on physical examination). 

Acute ACTH stimulation test 

When indicated (see above), the acute ACTH stimulation test was 
performed as described previously (17). In brief, all studies were started 
between 0730 and 0930 h in the fasting state and scheduled d 3-8 after 
a spontaneous vaginal bleed or after a withdrawal bleed induced by 
either 100 mg P4 in oil im or 300 mg/d oral micronized P4 for 7 d. 
Dexamethasone was not administered before the study. Three baseline 
samples were obtained 15 min apart and mixed (0-min sample). Imme- 
diately afterward, 0.25 mg ACTH-(l-24) (Cortrosyn; Organon Co., Or- 
ange, NJ) was administered iv over 60 sec, and blood was sampled 60 
min later. Both the 0- and 60-min samples were assayed for 17-HP levels. 
If the stimulated 17-HP level was greater than 10 ng/ml, the patient was 
considered to have 21 -hydroxylase (21-OH)-deficient NCAH (17). The 
diagnosis of NCAH was confirmed by the genotyping of CYP21, as 
described previously (17). 

Differential diagnosis 

Based on the above evaluation, two distinct types of androgen excess 
disorders, those with specific diagnoses ascertained by inclusion and 
those diagnosed by exclusion (defined as FAE disorders), were 
identified: 

Specific diagnoses ascertained by inclusion testing: 1) ASNs, diagnosed by 
resection and histopathology of the rumor; 2) 21-OH CAH, diagnosed 
by review of prior records indicating evidence of congenital virilization 
in the presence of dramatically elevated basal levels of 17-HP, generally 
90 nmol/liter (30 ng/ml) or greater (19); 3) 21-OH-deficient NCAH, 
diagnosed by an ACTH-stimulated 17-HP level greater than 10 ng/ml 
and confirmed by the genotyping of CYP21, as described previously (17); 
4) HAIRAN syndrome, diagnosed by a fasting basal INS greater than 80 
/jJU/ml and /or an INS level during a 3-h oral glucose tolerance test 
greater than 300 /xIU/ml, as described previously (20, 21). 

Disorders of FAE, diagnosed by exclusion: 1) PCOS, diagnosed by the 
presence of ovulatory dysfunction in association with hirsutism and /or 
elevated androgen levels, after exclusion of the above specific disorders, 
as recommended by a 1990 National Institutes of Health/National In- 
stitute of Child Health and Human Development-sponsored conference 
on the subject (22); 2) IH, diagnosed by the presence of hirsutism ac- 
companied by normal ovulation and normal androgens levels, after 
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exclusion of the above specific disorders (23); and 3) hyperandrogen- 
emia (HA) plus hirsutism / comprised of individuals with elevated an- 
drogen levels and hirsutism but normal ovulation. 

Assessment of response to suppressive hormonal therapy 

To determine the response to suppressive hormonal therapy in pa- 
tients with androgen excess, we retrospectively reviewed the charts of 
all androgen excess patients seen initially between October 1987 and 
June 2001. Using a uniform form, the documented treatment outcome 
and side effects as assessed by the patient were recorded. Patients 
included were those who had PCOS, HAIRAN, IH, or HA plus hirsutism 
(see above). We excluded those patients: 1) seeking infertility therapy; 
2) with inadequate follow-up (i.e. duration of the follow-up visits was 
less than 6 months, or who did not return after their initial visit); or 3) 
with poor compliance with treatment or follow-up (i.e. who were non- 
compliant in taking the medication according to prescribed directions, 
or who were seen for care at greater than 18-month increments). 

All patients with menstrual or ovulatory dysfunction received OCs 
when possible. Patients with unwanted hair growth and evidence of 
excess facial or body terminal hair growth received spironolactone (SPA) 
(200 mg if mF-G > 8; 100 mg/d if mP-G = 3-7) in combination with the 
OC, to minimize the risks of teratogenicity. SPA was rarely used alone, 
except in the occasional hirsute patient who had previously undergone 
a hysterectomy or tubal ligation. Other treatment regimens were occa- 
sionally used, including glucocorticoids, insulin sensitizers, GnRH an- 
alogs, flutamide, finasteride, and other estrogen-progestin combina- 
tions, alone or in combination; the majority of these were used as part 
of clinical trials (24-26). 

When side effects were identified, they were managed as follows. 
First, if side effects attributable to SPA occurred, the dose of SPA was 
decreased. On occasion, the SPA was changed to flutamide or finas- 
teride. Second, if persistent breakthrough bleeding (BTB) on the OC 
occurred, despite short-term supplementation with an oral estrogen, 
then the OC was changed to one containing 50 jig ethinyl estradiol plus 
1 mg ethyndiol diacetate. Third, if other side effects attributable to the 
OC occurred (e.g. headaches or migraines, weight gain, increased breast 
size, etc.), then the OC was changed to one containing 20 /xg ethinyl 
estradiol or discontinued altogether. Fourth, women who were hyper- 
tensive, smokers more than 35 yr of age, or those who had a previous 
thromboembolic event on OCs/ hormones were considered for cyclic 
progestogen treatment only, potentially in combination with SPA. 

Statistical analysis 

INS resistance and /3-cell function were calculated by the homeostatic 
model assessment method (HOMA-IR and HOMA-%/3-cell, respec- 
tively), as described previously (27). Two-group comparison of contin- 
uous variables was performed using a two-sample t test with adjustment 
for nonconstancy of variance, when necessary. More than two group 
means were compared using the ANOVA with post hoc least squares 
means pairwise comparisons. All categorical end points were compared 
using the statistic or Fisher's exact test, when appropriate. 

Results 

General features of patients excluded from the study 

A total of 1281 consecutive patients were seen during the 
time period of the study. Of these, 408 (31.9%) subjects were 
excluded from analysis (Table 1). In 157 (12.3% of total or 
38.5% of excluded) patients, we were unable to evaluate 
hormonal status generally because they were already on 
hormonal therapy and were unwilling to discontinue the 
medication for the evaluation. In 113 (8.8% of total or 27.7% 
of excluded) patients, we were unable to confirm ovulatory 
status, and 138 (10.8% of total or 33.8% of excluded) women 
did not have evidence of androgen excess. In total, 873 sub- 
jects were included in the study. 

Patients who were excluded from the study were slightly 
older (29.6 ± 15.4 vs. 27.7 ± 8.1 yr; P < 0.02) and of lesser BMI 
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TABLE 1. Patients excluded from study 



Exclusion 


Number 


% of total excluded 


% of total 


patients 


patients 


Unable to evaluate 




38.5 


12,2 


hormonal status 








Initial treatment 


156 






Missing androgen levels 


1 






Unable to determine 




27.7 


8.8 


ovulatory status 








Postmenopausal" 


19 






Premenarchal 


4 






Hysterectomized 6 


4 






Vaginal agenesis 


1 






Missing ovulatory 
assessment 0 


85 












Without androgen excess 




33.8 


10.8 


Normal: 28 


28 






Only HA: 26 


26 






Only oligo: 84 


84 






Total 


408 


100 


32.8 



rt Except for one patient with an ASN. 

6 Hysterectomized, and without information regarding menstrual 
or ovulatory function. 

c All with regular menses, d 26-34 in length, but without having 
ovulation verified by luteal progesterone level. 



(29.1 ± 8.9 vs. 32.9 ± 9.6 kg/m 2 ; P < 0.001, respectively) than 
individuals who were included, although the differences 
were deemed to be of limited clinical significance. There was 
no difference in racial composition between the two groups, 
with subjects that were excluded being 14.3% black, 84.3% 
white, and 1.4% of other races compared with 15.0% black, 
82.6% white, and 2.4% other races among those patients who 
were included. 

Clinical features of androgen excess patients included 
in the study 

Of the 873 patients included in the study 5.1, 16.0, 21.8, 
23.7, 17.4, 11.1, 3.2, and 1.6% were 15 yr old or younger, 
16-20, 21-25, 26-30, 31-35, 36-40, 41-45, and 46 yr old or 
older, respectively. Clinically, 75.5% had hirsutism and 
14.2% had acne, whereas 29.7% complained of infertility. 
Patients with hirsutism or infertility were slightly older 
[28.2 ± 7.6 vs. 26.5 ± 7.1 yr (P < 0.005) and 29.4 ± 5.6 vs. 27 A ± 
8.4 yr (P < 0.001), respectively] than women without these 
complaints. Patients with infertility were also more obese 
than their non-infertile counterparts (34.0 ± 8.9 vs. 32.5 ± 9.2 
kg/m 2 , P < 0.03, respectively). Alternatively, women with 
acne were younger and less overweight than those patients 
than non-acneic patients [23.6 ± 7.5 vs. 28.0 ± 7.0 yr (P < 
0.001) and 29.5 ± 8.6 vs. 33.5 ± 9.1 kg/m 2 (P < 0.001)]. 

Oligo-ovulation was present in 770 (88.2%) patients; 86.4% 
of oligo-ovulatory patients had overt menstrual dysfunction 
(12 with polymenorrhea and the remainder with oligomen- 
orrhea). One hundred five (13.6%) oligo-ovulatory patients 
had apparent eumenorrhea until evaluated more closely. Of 
hirsute women, 35 (5.3%) had apparent eumenorrhea, of 
which 14 (40%) had ovulatory dysfunction when evaluated 
by a luteal P4 level. Of the 124 patients with acne, 33.9% (or 
4.8% of the total) had acne only, without hirsutism, although 
all these patients had both hyperandrogenemia and 
oligo-ovulation. 
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Overall, 58.2% of patients included were obese (BMI, 
>30.0 kg/m 2 ), with BMIs as follows: 1.8% were underweight 
(<19.0 kg/m 2 ), 20.6% were of normal weight (19.0-24.9 kg/ 
m 2 ), 19.4% were preobese (25.0-29.9 kg/m 2 ), 18.7% were 
mildly obese (30.0-34.9 kg/m 2 ), 17.6% were moderately 
obese (35.0-39.9 kg/m 2 ), and 21.9% were severely obese 
(>40 kg/m 2 ), according to 1998 World Health Organization 
and 1999 National Center for Health Statistics /Centers for 
Disease Control and Prevention criteria (28, 29). 

Prevalence of abnormally elevated androgen measures 

Overall, 77.8% of patients included in the analysis had 
hyperandrogenemia. Total T was elevated in 325 (37.9%), 
free T in 476 (55.5%), and DHEAS in 347 (40.4%) of the 858 
subjects included. The prevalence of the different combina- 
tions of abnormal androgen levels is depicted in Table 2. 
Note that approximately one fifth of patients had no overt 
elevation in androgen levels, whereas another 10% had el- 
evations in all three androgens evaluated. 

Prevalence of the different diagnostic groups 

Among the 873 patients entered into the study, five were 
on thyroid replacement for hypothyroidism at the time of 
their initial visit, and one additional patient was diagnosed 
with hypothyroidism at her evaluation (total prevalence of 
thyroid patients was diagnosed as such during her evalua- 
tion, for a total dysfunction of 0.7%). Two patients were 
receiving bromocriptine for a previous diagnosis of hyper- 
prolactinemia, and one additional patient was diagnosed as 
such during her evaluation (for a total prevalence of hyper- 
prolactinemia of 0.3%). 

Of the 873 patients included in the study (Table 3), 
59 (6.76%) had specific identifiable disorders including 
ASNs, 21-OH-deficient CAH, 21-OH-deficient NCAH, and 
HAIRAN syndrome diagnosed in 0.23, 0.69, 2.06, and 3.78% 
of the total population, respectively. In the remainder (814 or 
93.24% of the total), disorders of FAE (i.e. of exclusion) were 
identified including PCOS, IH, and HA plus hirsutism in 
82.02, 4.47, and 6.75% of patients, respectively. 

Because ovulatory function was determined in all eum- 
enorrheic hirsute women beginning January 1995, the unbi- 



TABLE . 2. Prevalence of abnormally elevated androgen measures 
in 858° patients with androgen excess 



Abnormal androgen(s) 6 


Number 


Prevalence (%) 


None 


190 


22.2 


Total T only 


20 


2.4 


Free T only 


125 


14.6 


DHEAS only 


146 


17.0 


Total T and free T 


174 


20.3 


Total T and DHEAS 


24 


2.8 


Free T and DHEAS 


69 


8.0 


Total T, free T, and DHEAS 


109 


12.7 


Total 


858 


100.0 



° Note that 15 subjects did not have all three androgen results 
(total and free T and DHEAS) available and, thus, were not included 
in this analysis. 

* Abnormal androgen levels were defined as an androgen value 
above the 95th percentile of 98 healthy control women [Le. a total T ^ 
2.94 nmol/liter (88 ng/dl), free T > 0.026 nmol/liter (0.66 ng/dl), or 
DHEAS 2: 6.64 jimol/liter (2750 ng/ml)], as reported previously (1). 
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TABLE 3. Differential diagnosis of 873 patients evaluated for 
androgen excess 



Diagnosis 


Total no. 


% Prevalence 


% Unbiased 
prevalence 0 


Specific disorders 








ASN 


2 


0.23 




CAH 


6 


0.69 




NCAH 


18 


2.06 


1.60 


HAIRAN 


33 


3.78 


3.12 


Disorders of exclusion 








PCOS 


716 


82.02 




IH 


39 


4.47 


4.68 


HA + hirsutism 


59 


6,75 




Total 


873 


100.00 





° The unbiased prevalence of IH was calculated using the popu- 
lation screened after January 1995, because ovulatory function was 
determined in all eumenorrheic hirsute women beginning with that 
date. The unbiased prevalence of HAIRAN syndrome was calculated 
using the populatibn screened after June 1996, when the fasting 
insulin levels were routinely obtained. The unbiased prevalence of 
NCAH was based on the number of unrelated patients diagnosed 
during the study period and who had not been identified a priori by 
either a previous diagnosis or by having affected relatives. 

ased incidence of IH was calculated as 4.68% (23 of 491 
subjects studied). After June 1996, the fasting INS levels were 
routinely obtained, resulting in an unbiased incidence of 
HAIRAN syndrome of 3.12% (12 of 385). Finally, because 
three of the NCAH patients were diagnosed because they 
were relatives of a patient already diagnosed with this dis- 
order and one additional patient was diagnosed before the 
study, the unbiased incidence of NCAH was 1.60% (14 of 
873). Only two of our NCAH patients did not have a de- 
tectable mutation on genotyping, and both of these women 
had ACTH-stimulated 17-HP levels greater than 20 ng/ml, 
suggesting that their diagnosis was not in question. 

Comparison of diagnostic groups 

In comparing clinical and laboratory features between di- 
agnostic groups, we again excluded patients with CAH and 
ASN because of their small numbers. We observed (Table 4) 
that there was significant racial difference between the dif- 
ferent diagnoses, with NCAH patients being exclusively of 
the white race, whereas a greater proportion of HAIRAN 
patients were black (36.4%). 

HAIRAN syndrome patients were younger, and women 
with IH were older than all others. HAIRAN syndrome pa- 
tients also had a greater body mass and WHR than all others. 
Although PCOS patients had a lower mean BMI and WHR 
than HAIRAN patients, they were, on average, larger and 
had a higher WHRs than the other diagnostic groups. 

There was no difference between the groups regarding the 
prevalence of acne, whereas the differences observed in the 
frequency of oligo-ovulation and hirsutism were primarily 
related to the differences in the criteria for defining a diag- 
nostic group {e.g. by definition, patients with IH and HA plus 
hirsutism have hirsutism with normo-ovulation). PCOS pa- 
tients had the highest and HA plus hirsutism patients the 
lowest prevalence of infertility, respectively. 

When comparing measures of INS action, including fast- 
ing glucose, INS, HOMA-IR, and HOMA-%/3-cell, we ana- 
lyzed only those values obtained after April 1996, when these 
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TABLE 4, Characteristics of the study population by diagnostic group 
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Diagnostic group 



variaDie 


rOUo in — ( Lo) 


riAiKAIN (n — oo) 


NCAH (n = 18) 


in in — o*j) 


HA + hirsutism (n - 59) 


Age (yr) 


27.67 ± 7.26 


23.10 ± 7.96 a 


28.28 


± 10.65 


32.58 ± 9.29° 


97 71 
it i . 1 1 


± 8.99 


BMI (kg/m 2 ) 


33.39 ± 9.26 a 


38.69 ± 9.21° 


29.00 


± 6.32 


28.65 ± 7.13 


28.65 


± 6.72 


WHR 


0.83 ± 0.09^ 


0.87 ± 0.092*' 


0.83 


± 0.069 


0.81 ± 0.11 


0.79 


± 0.08 


Total T (ng/dl)* 


90.28 ± 55.67 c 


111.91 ± 13GJ2 b ' c ' f 


116.24 


± 64.61 fc e - / 


57.40 ± 13.84 6 ' cd 


1 o 


1Q fi1 c « rf 

— 117. Ol 


SHBG (nmol/liter)* 


182 ± 7l c ' d - e 


155 ± 41° 


235 


± 15 b ' c - r 


243 ± 91 6 ' c * c 


186 


± 68 d,c 


Free T (ng/dl)* 


0.92 ± 0.57' 


0.95 ± 0.54 


1.27 


± 0.55 6 ' 


0.47 ± 0.15° 


ft 7*i 


± 0.27 6,<J '^ 


DHEAS (ng/ml)* 


2350.8 ± 1289.6 a 


1897.6 ± 1215.9 W 


3485.6 


± 1691.2 6 ' c - e 


368.46 ± 722.18 w 


2910.8 


± 1204.9 fe,c ' e 


mF-G score 


8.2 ± 4.9 c/r 


10.1 ± 5.22 6 


7.78 


± 3.98 


9.0 ± 3.6 . 


9.6 


± 3.2 6 


Fasting plasma INS 


21.1 ± l,Q c eJ 


88.1 ± 4.5 a 


9.5 


± 8.4° 


12.0 ± 4.5 6 c 


12.9 


± 3.6 6 ' c 


/ T T / I \h 
















Plasma GLU (mg/dl)* 


89.5 ± 1.2 


92.6 ± 5.4 


91.5 


± 10.1 


89.4 ± 5.4 


86.5 


±4.2 


HOMA-IR (fiU-mol/ml)* 


4.70 ± 0.37 


22.26 ± 1.60° 


2.22 


±3.39 


2.78 ± 2.53 


2.80 


±3.06 


HOMA-%j3-cell CmU/mol)* 


335.3 ± 369.3 


193.7 ± 1012.3 


113.0 


± 133.3 


173.7 ± 216.2 


213.9 


± 158.2 


Race 
















White (%) 


83.2 C 


63,6 W 


100.0 C 


79.0 




84.7 C 


Black (%) 


14.4 r 


36A bdf 




0.0 C 


18.4 




10,2 C 


Other (%) 


2.4 


0.0 




0.0 


2.6 




5.0 


Acne (%) 


14.5 


6.1 




22.2 


17.9 




11.9 


Oligo-ovulatory (%)' 


lOO.O**-' 


100.0^ 




88.9 b - e ' r 


0 Qb.C.d 




0.0 6 ' c#d 


Infertile (%) 


32.7*^ 


13.8" 




22.2 


23.1>" 




6.8 6 - fi 


Hirsute (%) iJ 


72.2 C ' / * 


84.8 e - r 




72.2 ef 


100.0 A * cd 


100.0 6c * rf 


Obesity (%)* 


60.0 CC ^ 


90.0 bd - ef 




50.0 C 


35.9* c 




37.3*- c 



The data are mean ± SD. Variables: total and free T [conversion factor to SI units (nmol/liter) is 0.03467]; DHEAS [conversion factor to SI 
units (juunol/liter) is 0.002714]; INS [conversion factor to SI units (pmol/liter) is 7.175]; GLU [conversion factor to SI units (mmol/liter) is 0.05551], 
° Significantly different from all other diagnostic groups. 
6 Significantly different from PCOS. 
c Significantly different from HAIRAN. 
d Significantly different from NCAH. 
e Significantly different from IH. 
r Significantly different from HA + hirsutism. 

* IH patients were excluded from the analysis of androgen measures. 

h The values of INS, GLUC, HOMA-IR, and HOMA-%/3-cell depicted and compared are for the period of April 1996 through June 2002, when 
these measures were obtained routinely. 

*' By definition, patients with IH or with HA + hirsutism are always hirsute and normo-ovulatory. 
J Hirsutism is defined as a mF-G score of 6 or greater. 

* Obesity is defined as 30 kg/m 2 or greater (28, 29). 



measures were obtained routinely. As expected, HAIRAN 
patients had higher fasting INS and HOMA-IR values than 
all other subjects (Table 4). Patients with PCOS also had 
higher INS levels than women with IH or HA plus hirsutism. 
There were no differences in mean fasting GLU levels or 
HOMA-%]3-cell values between groups. 

Long-term response to hormonal suppressive therapy 

Subjects. To assess the long-term response of androgen excess 
patients to hormonal suppressive therapy, we reviewed 736 
consecutive patient charts. Of these, we excluded 102 pa- 
tients (13.8% of the total) because we were unable to locate 
their full medical record. Of the remaining patients, 133 
(18.1% of total or 21.0% of those patients in whom the chart 
was found) were excluded because they sought fertility ther- 
apy only. Another 244 (33.1% of total or 38.5% of those 
patients in whom the chart was found) patients were further 
excluded from analysis due to inadequate duration of 
follow-up (n = 172 or 23.4% of total), or for noncompliance 
with either treatment or follow-up (n = 72 or 9.8% of total). 
Patients who had inadequate duration of follow-up or were 
not compliant with either treatment or follow-up were more 
likely to be black but not different in age, BMI, or initial mF-G 
score than those subjects who were compliant with therapy 



and follow-up (Table 5). Overall, among patients with a chart 
available, 31.7% of black patients were compliant and had 
adequate follow-up compared with 54.1% of white patients. 

Of the 257 patients who met the inclusion criteria for 
compliance and follow-up, 246 (95.7%) presented with an 
initial (although not sole) complaint of excess body and /or 
facial hair growth, 178 (69.3%) with some form of menstrual 
dysfunction, 42 (16.3%) with acne, 12 (4.7%) with hair loss, 
and 22 (8.6%) with weight gain or obesity. It should be noted 
that the patient's initial complaint did not necessarily coin- 
cide with findings on physical examination. For example, of 
246 patients presenting with a complaint of hirsutism, 38 
(15.4%) had a mF-G score 5 or less, whereas five women with 
a FG score of 6 or greater did not initially note excess hair 
growth. 

Two hundred (77.8%) of the patients included in this anal- 
ysis had PCOS, 27 (10.5%) had hirsutism and hyperandro- 
genemia but normal ovulatory function, 12 (4.7%) had 
HAIRAN syndrome, 10 (3.9%) had NCAH, and eight (3.1) 
had IH. Overall, the small numbers of subjects in some of 
these categories precluded a separate analysis of therapeutic 
success rates by diagnostic group. The initial treatment pre- 
scribed was OC plus SPA in 181 (70.4%) patients, OC only in 
22 (8.6%) patients, SPA only in 30 (11.7%) patients, and other 
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TABLE 5. Characteristics of androgen excess patients who were included vs. those who were excluded due to an inadequate duration of 
follow-up (F/U), noncompliance with either treatment or follow-up, or inability to retrieve full medical record 



Characteristics 


Included 0 (n - 257) 


<6 months F/U 6 (n = 172) 


Noncorapliant 6 (n = 72) 


Chart not found 6 (n = 102) 


Race r 










White 


230 (90%) 


140 (81%) 


55 (77%) 


84 (82.3%) 


Black 


20 (8%) 


27 (16%) 


16 (22%) 


17 (16.7%) 


Other 


7 (3%) 


5 (3%) 


1 (1%) 


1 (1%) 


Age ( yr) 










Mean ± SD 


27.6 ± 8,9 


28.2 ± 8,1 


26.2 ± 8.2 


28.2 ± 7.5 


Range 


13-48 


13-55 


14-47 


12-51 


BMI (kg/m 2 ) 










Mean ± SD 


31.8 ± 9.5 


33.8 ± 9.2 


33.3 ± 9.7 


31.6 ± 8.4 


Range 


17.7-65.0 


16.5-67.4 


17.1-65.5 


18.4-54.8 


Initial mF-G score 










Mean ± SD 


9.3 ± 4.7 


8.9 ± 4.9 


9.6 ± 4.3 


8.1 ± 5.0 


Range 


0-27 


0-31 


2-18 


0-24 



° Patients who were compliant with both treatment and F/U were included in the analysis. 

b Patients with inadequate F/U, noncompliance with either treatment or F/U, in whom we were unable to retrieve their full medical record, 
were excluded from the analysis. 

c Significant difference in race between the groups (x 2 = 15.743; P < 0.005). 



TABLE 6. Self-reported treatment outcome in hyperandrogenic 
patients receiving hormonal suppression and who were compliant 
with therapy 



Clinical feature 


Treatment 
outcome 


No. of patient 
responses 


Overall % 
response 


Excess body and/or 


Better 


207 


85.9 


hair growth 










Same 


30 


12.4 




Worse 


4 


1.7 


Menstrual 


Better 


146 


79.8 


dysfunction 










Same 


26 


14.2 




Worse 


11 


6.0 


Acne 


Better 


29 


80.6 




Same 


6 


16.7 




Worse 


1 


2.8 


Scalp hair loss 


Better 


3 


33.3 




Same 


2 


22.2 




Worse 


4 


44.0 



medications or combinations in 24 (9.3%) patients. The mean 
follow-up period was 33.5 ± 30.1 months (range, 6-129). 

Outcome. Self-reported treatment outcomes are depicted in 
Table 6, The success rate (i.e. patients reporting that they were 
better) in women who were compliant with therapy ranged 
from 80-86% for hirsutism, menstrual dysfunction, and 
acne. The success rate of treating alopecia, albeit with a small 
number of patients with this complaint, was significantly 
lower at 33%. There were no differences in mean BMI or age 
between patients experiencing improvement in either hir- 
sutism or menstrual dysfunction. A meaningful analysis 
could not be performed for acne or alopecia due to the small 
number of subjects in each group. 

The initial and follow-up mF-G scores were available in 
176 subjects of a total of 213 patients presenting with hir- 
sutism (i.e. mF-G score > 6). The mean initial mF-G score was 
10.5 ± 4.1, and the last recorded mF-G score was 4.7 ± 3.4, 
with a mean net change in mF-G score of 5.9 ± 4.1 in 3.4 ± 
2.7 yr. The mean rate of decrease in the mF-G score was 
-2.8 ± 3.3 per year. Overall, 65.9% of patients had a mF-G 
score of 5 or less at their last recorded follow-up evaluation. 
If only those patients who received combination suppressive 
therapy (OC plus SPA) were analyzed, the mean net change 



TABLE 7. Prevalence of side effects among hyperandrogenic 
patients receiving suppressive therapy and who were compliant 
with their treatment 





No. of 


% of 


Side effect 


patients 


patients 


with side 


with side 




effect 


effect 


Irregular vaginal bleeding or BTB 


41 


16.1 


Nausea or vomiting 


33 


13.0 


Headache or migraines 


32 


12.6 


Depression or mood changes 


30 


11.8 


Vaginal infections/irritation 


13 


5.1 


Fatigue 


13 


5.1 


Muscle cramps 


8 


3.1 


Decreased libido 


7 


2.8 


Dyspepsia or heartburn 


7 


2,8 


Abdominal pain 


6 


2.4 


Weight gain 


6 


2.4 


Hot flashes 


5 


2.0 


Insomnia 


4 


1.6 


Alopecia 


3 


1.2 


Weight loss 


2 


0.8 


Acne 


2 


0.8 


Night sweats 


2 


0.85 


Dry eyes 


2 


0.8 


Bloating 


2 


0.8 


Heart palpitations 


2 


0.8 


No symptoms 


93 


36.6 



in mF-G score was -6.2 ± 4.1, with a self reported rate of 
improvement of 89.0%. One hundred one (47.4% of the total) 
patients with a mF-G score or 6 or greater reported using 
electrology. Patients who used eiectrology concomitantly 
with hormonal suppression had a greater net change in their 
mF-G score compared with those hirsute women not using 
electrology (3.0 ± 3.0 vs. -2.6 ± 2.4; P < 0.05). 

Side effects. The most common side effects of hormonal sup- 
pressive therapy reported were headache, nausea, and ir- 
regular vaginal bleeding (Table 7), Only 93 (36.6%) subjects 
assessed did not report any side effects. Of patients having 
BTB, 38.2% stated that the hormonal therapy did not improve 
or actually worsened their menstrual dysfunction compared 
with 16.1% of patients without BTB (P < 0.005). 
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Discussion 

We have reported on our experience evaluating over 800 
patients with androgen excess. Of the patients included in the 
study, two thirds were 30 yr old or younger, with 20% of 
patients under the age of 20 yr at the time of their initial visit. 
Assuming that most androgen excess disorders begin to be- 
come clinically evident peripubertally, these data suggest 
that up to 80% of patients are not recognized, evaluated, and 
treated in a timely fashion. Approximately 60% of patients 
studied were obese (BMI, S30.0 kg/m 2 ), and 20% were se- 
verely obese (BMI, >40 kg/m 2 ). Hence, the prevalence of 
obesity among androgen excess patients in this study is at 
least 2-fold higher than that of the general population. For 
example, data from the National Health and Nutrition Ex- 
amination Survey project indicate that the prevalence of obe- 
sity in women 20-34 yr of age of all ethnicities /races was 
25.8% in the 1999-2000 survey (30, 31). 

In our population, clinically evident androgen excess, 
namely hirsutism, was observed in 75% of subjects. We 
should note that the prevalence of hirsutism would have 
been lower if the proportion of patients of Asian descent 
(<1% in the studied population) were greater, because these 
individuals tend to have lower degrees of excess hair growth 
(5). An improvement in hirsutism with combination hor- 
monal suppression (an OC and SPN) was reported by ap- 
proximately 90% of patients, and approximately 65% of these 
women had minimal excess hair growth (i.e. a mF-G score 
<5) at their last recorded follow-up examination after a mean 
treatment period of approximately 3.5 yr. This success rate 
is consistent with previous reports evaluating antiandrogen 
therapy, alone or in combination with OCs, for the treatment 
of hirsutism (32, 33). Our data also indicated that patients 
who used electrology in addition to hormonal suppression 
had a greater decrease in the hair growth than those who did 
not, suggesting that this hair destruction technique should be 
encouraged as an adjuvant in the management of the hirsute 
woman. 

Acne was the sole complaint or finding in 4.8% of hy- 
perandrogenic patients evaluated, all of whom were both 
hyperandrogenemic and oligo-ovulatory. We and others 
have noted previously that the prevalence (34-36) of hy- 
perandrogenemia among acneic patients is significant and 
that this clinical feature may be used as a marker for andro- 
gen excess, regardless of the patient's age. However, we 
should note that our study design does not allow us to 
estimate the prevalence of androgen excess among acneic 
patients, because we . included only hyperandrogenic pa- 
tients into our study. The self-reported improvement in acne 
with hormonal suppression was approximately 80%. Similar 
success rates have been reported in randomized trials of 
acneic women treated with an OCP only (37-40). Although 
spironolactone has also been found to have a positive effect 
on acne, a systematic review of published data could not 
determine the effectiveness of treatment and its value in 
clinical practice due to the small sample populations in- 
volved in previous trials (32). 

Scalp hair loss was a complaint in 14 (4.0%) patients. All 
but one had ovulatory dysfunction, and all were also hirsute. 
Our data are consistent with that of others, indicating that the 
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prevalence of androgen excess is high in alopetic women, 
particularly in those with ovulatory dysfunction (41). As for 
acne, our study design does not allow us to estimate the 
prevalence of androgen excess among alopetic patients, in 
general. Hormonal suppression, even in these hyperandro- 
genic patients, had limited effect on their alopecia, although 
the response of hirsutism and menstrual dysfunction in these 
patients was overall similar to that of the study population 
as a whole. The poor response of androgenic alopecia to 
androgen blockade and /or hormonal suppression is consis- 
tent with the results of a recent randomized trial (42). 

Almost 90% of our patients had ovulatory dysfunction, 
approximately 85% with obvious oligomenorrhea and ap- 
proximately 1.5% with polymenorrhea (cycles less than 26 d 
in length). Almost 15% of oligo-ovulatory patients had ap- 
parent eumenorrhea until evaluated more closely using a 
luteal (d 22-24) phase P4 level with or without basal body 
temperature monitoring. Of hirsute eumenorrheic women 
(5.3% of the total hirsute population), 40% were actually 
found to have ovulatory dysfunction when evaluated more 
closely. These data confirm the high prevalence of ovulatory 
dysfunction among hirsute women with regular menstrual 
cycles reported by others and ourselves (23, 43). Thus, direct 
assessment of ovulatory function should be a routine part of 
the evaluation of the hirsute woman with apparently regular 
episodes of vaginal bleeding. The success of hormonal ther- 
apy in subjectively improving menstrual dysfunction was 
approximately 80%. The high rate of BTB (-16%) may have 
precluded a higher success rate in the treatment of this clin- 
ical feature, because patients who experienced BTB were 
twice as likely as those who did not to state that hormonal 
therapy did not improve their menstrual dysfunction (38.2 
vs. 16.1%). 

Overall, approximately 78% of patients included in this 
study had detectable hyperandrogenemia, indicating that 
approximately 30% of patients did not have overt evidence 
of hyperandrogenemia, consistent with previous reports (1, 
3, 44, 45). Free T was the most prevalent marker, solely 
elevated in 15% of patients and in combination with abnor- 
malities of either total T or DHEAS in another approximate 
40%. DHEAS also proved to be useful because it was the only 
laboratory abnormality detected in 17% of patients studied. 
However, total T was much less useful, the only androgen to 
be elevated in only 2% of subjects. 

Because approximately 25% of our patients are nonhirsute 
and primarily diagnosed by having laboratory evidence of 
hyperandrogenemia, our data indicate that the measurement 
of free T and DHEAS are indicated in the evaluation of the 
nonhirsute woman suspected of suffering from androgen 
excess. Nonetheless, a few caveats are evident. First, the 
diagnostic value of free T is closely related to the assay 
method used. Recommended methods for the assessment of 
free T includes equilibrium dialysis (46, 47), calculation of 
free T from the measurement of SHBG and total T (48, 49), 
or ammonium sulfate precipitation (50). In general, current 
direct assays for the measurement of free T have limited 
value, particularly in the evaluation of the hyperandrogenic 
woman (46, 47). Second, the impact of the age-related decline 
in DHEAS levels, suggesting the need for age-related nor- 
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mative values, on the diagnostic value of this metabolite 
remains unclear (51). 

In addition to the evaluation of the patient with uncertain 
androgenization, some investigators feel that the measure- 
ment of total T and DHEAS has some value in the detection 
of ASNs (52). However, more recent data suggest that the 
best predictor of these neoplasms is the clinical presentation, 
because androgen measures may be misleading and have a 
low positive predictive value (16, 53-55). In the population 
studied, both of our patients with ovarian androgen-secret- 
ing tumors had total T levels greater than 6.94 nmol/liter (200 
ng/dl) when diagnosed. One patient was 65 yr old and had 
a rapid and sudden onset of severe hyperandrogenic signs 
and symptoms, suggesting the presence of an ASN, regard- 
less of androgen levels (18). However, the second patient was 
originally mistaken for having 21-OH-deficient NCAH, and 
it was only after the circulating total T levels rose persistently 
to levels above 6.94 nmol/liter despite treatment with com- 
bination therapy that the correct diagnosis was suspected. 
Hence, although the positive predictive value of a total T 
level greater than 6.94 nmol/liter (200 ng/dl) is approxi- 
mately 10% (18), this androgen may still have some value in 
identifying less clinically obvious patients with an ASN. 

Thyroid dysfunction was found to be a relatively uncom- 
mon abnormality in our hyperandrogenic patients. Ferriman 
and Purdie (56) reported that none of their 467 hirsute 
women had a thyroid abnormality, whereas we diagnosed 
only one of our 873 patients as having hypothyroidism (al- 
though five others were already on thyroid replacement be- 
fore their evaluation). This prevalence is similar or actually 
less than that reported by other investigators in the general 
population of women of similar age (57-61). Hyperpro- 
lactinemia was also rare, diagnosed in only one (0.3%) of our 
patients during their evaluation, a prevalence similar to that 
found by other investigators (6, 55, 62-64). 

Approximately 7% of our androgen excess patients suf- 
fered from specific and definable disorders of inclusion, in- 
cluding ASNs (0.2%), CAH (0.7%), 21-OH-deficient NCAH 
(1.6%), and the HAIRAN syndrome (3.1%). Whereas ASNs 
are diagnosed by pathological examination and 21-OH- 
deficient NCAH by CYP21 genotyping, the diagnosis of 
HAIRAN syndrome is less clear. Although previous inves- 
tigators have established diagnostic levels of basal or GLU- 
stimulated INS levels that are 3- to 5-fold the upper normal 
limits and some patients may have overt lipodystrophy (20, 
21), it may be argued that many of these women simply 
represent a more metabolically affected group for PCOS. 
Only more intensive family and genetic studies will elucidate 
the true etiology and diagnostic criteria for this syndrome. 

The remainder of patients evaluated had FAE or disorders 
of exclusion, including PCOS (82%), IH (4.7%), and hirsute 
and hyperandrogenemic patients with normal ovulatory 
function (6.8%). None of our patients were found to have 
Cushing's syndrome or adrenal neoplasms. Also, although a 
few patients were seen with minimal clinical evidence of 
androgen excess (e.g. acne and /or seborrhea) after the ad- 
ministration of an androgenic drug, none were found to have 
sufficiently significant clinical and /or biochemical hyperan- 
drogenism to meet the entry criteria into this study. 
. Notwithstanding significant variations in how the disor- 
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ders were defined and the ethnicity of the populations stud- 
ied, our prevalences are generally similar to those of other 
investigators (6, 55, 62-64). In these studies, the prevalence 
of ASNs ranged from 0.6-2.1% (ovarian ASNs, 0.3-1%; ad- 
renal ASNs, 0-2.1%), NCAH from 1-3%, drug-related hy- 
perandrogenism from 0-1%, and Cushings syndrome from 
0-1%. The widest variations were found in the prevalences 
of PCOS (37.8-78%) and IH (15-38.7%), although the relative 
degrees were in general agreement with those of our study. 
We should note that previous reports differ from the present 
study by using criteria for the disorders diagnosed that was 
ambiguous or is currently outdated, and for not adjusting the 
prevalence rates reported for the potential bias inherent to 
the screening methods used. 

Comparing the various diagnostic groups, we noted that 
HAIRAN syndrome patients were younger, more obese, 
more likely to be black, more hirsute, more INS resistant, and 
had the lowest SHBG levels compared with the other phe- 
notypes. Patients with PCOS had the second highest degree 
of obesity and INS resistance and were the most likely to be 
infertile. Patients with IH were, on average, older and, as 
expected, had lower total T, free T, and DHEAS levels. 
Women with NCAH had the highest mean DHEAS and total 
and free T levels and were exclusively of the white race. 

Approximately 7% of androgen excess patients had hir- 
sutism and hyperandrogenemia, but with normal ovulation, 
and represent a diagnostic conundrum. Compared with 
PCOS patients, these women had a similar mean age but 
lower mean BMI, were less likely to be infertile, reflecting 
their normal ovulatory status; and had lower free T levels 
despite being more hirsute. They did, however, have higher 
DHEAS levels than PCOS patients. On average, these pa- 
tients were slightly younger and had higher DHEAS and free 
T levels compared with IH patients. However, they had 
similar mean BMI and obesity rates, measures of INS action, 
and mF-G scores as patients with IH. These data suggest that 
patients with hirsutism and hyperandrogenemia, but with 
normal ovulation, may more closely resemble IH than PCOS 
patients and may represent individuals with predominantly 
adrenal androgen excess. Whether this clinical presentation 
represents a stable phenotype or is a transition form to PCOS 
remains to be determined. 

Although we have discussed the overall results of hor- 
monal therapy above, a few facts are worth highlighting. 
First, of the 501 patients who were seeking hormonal sup- 
pression of their androgenic symptoms and for whom the 
chart was located, approximately 50% had either less than 6 
months of follow-up or were noncompliant with either treat- 
ment or follow-up. These patients were more likely to be 
black but did not differ in the apparent severity of their 
disorder. Impacting on compliance may be the high rate of 
side effects with therapy. Although few, if any, serious side 
effects were observed, only 37% of subjects studied did not 
report any side effects. Notably, in Alabama, few other prac- 
titioner options were available for patients desiring man- 
agement of their hyperandrogenism. These data suggest that 
we must identify strategies for improving the level of edu- 
cation and compliance of these women. 

The present report has a number of strengths: the large 
number of patients included; the fact that baseline features 
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were obtained and maintained prospectively; the careful 
phenotyping implemented, including early recognition of 
the possibility of HAIRAN syndrome and NCAH and use of 
a standardized mF-G hirsutism scoring system; the recog- 
nition that diagnostic criteria evolved throughout the study; 
the uniformity of the assessment as performed by one in- 
vestigator (R.A.); and the recognition that the investigator 
served a large catchment area. However, the study also has 
a few notable limitations: the fact that the protocol and def- 
initions were modified over time, consistent with changes in 
our understanding of the disorders under consideration; the 
retrospective nature of our assessment of the presence of 
alopecia and of the therapeutic outcomes; and the increasing 
reputation of the examiner in this area of study throughout 
the time of the study, which may have increased the pro- 
portion of patients seen with rare causes of androgen excess. 
Nonetheless, it is unlikely that these limitations would sig- 
nificantly affect the results of the study. 

In conclusion, in this large study of consecutive patients, 
PCOS was observed in approximately 80%, IH in approxi- 
mately 5%, HAIRAN syndrome in approximately 3%, and 
NCAH in approximately 1.5% of patients, and ASNs were 
observed in approximately one of 500 androgen excess pa- 
tients. The etiology of 7% of women who demonstrated hir- 
sutism and hyperandrogenemia, but normal ovulation, re- 
mains to be determined. Over 80% of androgen excess 
patients compliant with their therapy experienced an im- 
provement in hirsutism, menstrual dysfunction, and acne 
with a suppressive hormonal regimen, although only 36% 
were free of side effects. Alternatively, hair loss, although an 
infrequent complaint in our population, failed to improve in 
almost 70% of patients affected. Notably, almost 50% of pa- 
tients did not appear to be compliant with therapy. These 
data indicate that PCOS is the most frequent cause of an- 
drogen excess, with all other remaining causes affecting a 
minority of patients, and that, with the exception of alopecia, 
therapeutic success is high in patients who are compliant 
with hormonal therapy. 
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The Development and Causes of Cancer 

The fundamental abnormality resulting in the development of cancer is the 
continual unregulated proliferation of cancer cells. Rather than responding 
appropriately to the signals that control normal cell behavior, cancer cells 
grow and divide in an uncontrolled manner, invading normal tissues and 
organs and eventually spreading throughout the body. The generalized loss of 
growth control exhibited by cancer cells is the net result of accumulated 
abnormalities in multiple cell regulatory systems and is reflected in several 
aspects of cell behavior that distinguish cancer cells from their normal 
counterparts. 

Types of Cancer 

Cancer can result from abnormal proliferation of any of the different kinds of 
cells in the body, so there are more than a hundred distinct types of cancer, 
which can vary substantially in their behavior and response to treatment. The 
most important issue in cancer pathology is the distinction between benign 
and malignant tumors (Figure 15.1 ). A tumor is any abnormal proliferation of 
cells, which may be either benign or malignant. A benign tumor , such as a 
common skin wart, remains confined to its original location, neither invading 
surrounding normal tissue nor spreading to distant body sites. A malignant 
tumor , however, is capable of both invading surrounding normal tissue and 
spreading throughout the body via the circulatory or lymphatic systems 
( metastasis) . Only malignant tumors are properly referred to as cancers, and 
it is their ability to invade and metastasize that makes cancer so dangerous. 
Whereas benign tumors can usually be removed surgically, the spread of 
malignant tumors to distant body sites frequently makes them resistant to 
such localized treatment. 

Both benign and malignant tumors are classified according to the type of cell 
from which they arise. Most cancers fall into one of three main groups: 
carcinomas, sarcomas, and leukemias or lymphomas. Carcinomas , which 
include approximately 90% of human cancers, are malignancies of epithelial 
cells. Sarcomas , which are rare in humans, are solid tumors of connective 
tissues, such as muscle, bone, cartilage, and fibrous tissue. Leukemias and 
lymphomas , which account for approximately 8% of human malignancies, 
arise from the blood-forming cells and from cells of the immune system, 
respectively. Tumors are further classified according to tissue of origin (e.g., 
lung or breast carcinomas) and the type of cell involved. For example, 
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fibrosarcomas arise from fibroblasts, and erythroid leukemias from 
precursors of erythrocytes (red blood cells). 

Although there are many kinds of cancer, only a few occur frequently ( Table 
15.1 ). More than a million cases of cancer are diagnosed annually in the 
United States, and more than 500,000 Americans die of cancer each year. 
Cancers of 10 different body sites account for more than 75% of this total 
cancer incidence. The four most common cancers, accounting for more than 
half of all cancer cases, are those of the breast, prostate, lung, and 
colon/rectum. Lung cancer, by far the most lethal, is responsible for nearly 
30% of all cancer deaths. 

The Development of Cancer 

One of the fundamental features of cancer is tumor clonality, the 
development of tumors from single cells that begin to proliferate abnormally. 
The single-cell origin of many tumors has been demonstrated by analysis of 
X chromosome inactivation ( Figure 15.2 ). As discussed in Chapter 8 , one 
member of the X chromosome pair is inactivated by being converted to 
heterochromatin in female cells. X inactivation occurs randomly during 
embryonic development, so one X chromosome is inactivated in some cells, 
while the other X chromosome is inactivated in other cells. Thus, if a female 
is heterozygous for an X chromosome gene, different alleles will be 
expressed in different cells. Normal tissues are composed of mixtures of cells 
with different inactive X chromosomes, so expression of both alleles is 
detected in normal tissues of heterozygous females. In contrast, tumor tissues 
generally express only one allele of a heterozygous X chromosome gene. The 
implication is that all of the cells constituting such a tumor were derived 
from a single cell of origin, in which the pattern of X inactivation was fixed 
before the tumor began to develop. 

The clonal origin of tumors does not, however, imply that the original 
progenitor cell that gives rise to a tumor has initially acquired all of the 
characteristics of a cancer cell. On the contrary, the development of cancer is 
a multistep process in which cells gradually become malignant through a 
progressive series of alterations. One indication of the multistep development 
of cancer is that most cancers develop late in life. The incidence of colon 
cancer, for example, increases more than tenfold between the ages of 30 and 
50, and another tenfold between 50 and 70 ( Figure 15.3 ). Such a dramatic 
increase of cancer incidence with age suggests that most cancers develop as a 
consequence of multiple abnormalities, which accumulate over periods of 
many years. 

At the cellular level, the development of cancer is viewed as a multistep 
process involving mutation and selection for cells with progressively 
increasing capacity for proliferation, survival, invasion, and metastasis 
( Figure 15.4) . The first step in the process, tumor initiation, is thought to be 
the result of a genetic alteration leading to abnormal proliferation of a single 
cell. Cell proliferation then leads to the outgrowth of a population of clonally 
derived tumor cells. Tumor progression continues as additional mutations 
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occur within cells of the tumor population. Some of these mutations confer a 
selective advantage to the cell, such as more rapid growth, and the 
descendants of a cell bearing such a mutation will consequently become 
dominant within the tumor population. The process is called clonal selection, 
since a new clone of tumor cells has evolved on the basis of its increased 
growth rate or other properties (such as survival, invasion, or metastasis) that 
confer a selective advantage. Clonal selection continues throughout tumor 
development, so tumors continuously become more rapid-growing and 
increasingly malignant. 

Studies of colon carcinomas have provided a clear example of tumor 
progression during the development of a common human malignancy ( Figure 
15.5 ). The earliest stage in tumor development is increased proliferation of 
colon epithelial cells. One of the cells within this proliferative cell population 
is then thought to give rise to a small benign neoplasm (an adenoma or 
polyp ). Further rounds of clonal selection lead to the growth of adenomas of 
increasing size and proliferative potential. Malignant carcinomas then arise 
from the benign adenomas, indicated by invasion of the tumor cells through 
the basal lamina into underlying connective tissue. The cancer cells then 
continue to proliferate and spread through the connective tissues of the colon 
wall. Eventually the cancer cells penetrate the wall of the colon and invade 
other abdominal organs, such as the bladder or small intestine. In addition, 
the cancer cells invade blood and lymphatic vessels, allowing them to 
metastasize throughout the body. + top 

Causes of Cancer 

Substances that cause cancer, called carcinogens , have been identified both 
by studies in experimental animals and by epidemiological analysis of cancer 
frequencies in human populations (e.g., the high incidence of lung cancer 
among cigarette smokers). Since the development of malignancy is a 
complex multistep process, many factors may affect the likelihood that 
cancer will develop, and it is overly simplistic to speak of single causes of 
most cancers. Nonetheless, many agents, including radiation, chemicals, and 
viruses, have been found to induce cancer in both experimental animals and 
humans. 

Radiation and many chemical carcinogens (Figure 15.6) act by damaging 
DNA and inducing mutations. These carcinogens are generally referred to as 
initiating agents, since the induction of mutations in key target genes is 
thought to be the initial event leading to cancer development. Some of the 
initiating agents that contribute to human cancers include solar ultraviolet 
radiation (the major cause of skin cancer), carcinogenic chemicals in tobacco 
smoke, and aflatoxin (a potent liver carcinogen produced by some molds that 
contaminate improperly stored supplies of peanuts and other grains). The 
carcinogens in tobacco smoke (including benzo(a)pyrene, 
dimethylnitrosamine, and nickel compounds) are the major identified causes 
of human cancer. Smoking is the undisputed cause of 80 to 90% of lung 
cancers, as well as being implicated in cancers of the oral cavity, pharynx, 
larynx, esophagus, and other sites. In total, it is estimated that smoking is 
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responsible for nearly one-third of all cancer deaths— an impressive toll for a 
single carcinogenic agent. 

Other carcinogens contribute to cancer development by stimulating cell 
proliferation, rather than by inducing mutations. Such compounds are 
referred to as tumor promoters , since the increased cell division they induce 
is required for the outgrowth of a proliferative cell population during early 
stages of tumor development. The phorbol esters that stimulate cell 
proliferation by activating protein kinase C (see Figure 13.26 ) are classic 
examples. Their activity was defined by studies of chemical induction of skin 
tumors in mice ( Figure 15.7 ). Tumorigenesis in this system can be initiated 
by a single treatment with a mutagenic carcinogen. Tumors do not develop, 
however, unless the mice are subsequently treated with a tumor promoter 
(usually a phorbol ester) to stimulate proliferation of the mutated cells. 

Hormones, particularly estrogens, are important as tumor promoters in the 
development of some human cancers. The proliferation of cells of the uterine 
endometrium, for example, is stimulated by estrogen, and exposure to excess 
estrogen significantly increases the likelihood that a woman will develop 
endometrial cancer. The risk of endometrial cancer is therefore substantially 
increased by long-term postmenopausal estrogen replacement therapy with 
high doses of estrogen alone. Fortunately, this risk is minimized by 
administration of progesterone to counteract the stimulatory effect of 
estrogen on endometrial cell proliferation. However, long-term therapy with 
combinations of estrogen and progesterone may lead to an increased risk of 
breast cancer. 

In addition to chemicals and radiation, some viruses induce cancer both in 
experimental animals and in humans. The common human cancers caused by 
viruses include liver cancer and cervical carcinoma, which together account 
for 10 to 20% of worldwide cancer incidence. These viruses are important 
not only as causes of human cancer; as discussed later in this chapter, studies 
of tumor viruses have played a key role in elucidating the molecular events 
responsible for the development of cancers induced by both viral and 
nonviral carcinogens. ♦ top 

Properties of Cancer Cells 

The uncontrolled growth of cancer cells results from accumulated 
abnormalities affecting many of the cell regulatory mechanisms that have 
been discussed in preceding chapters. This relationship is reflected in several 
aspects of cell behavior that distinguish cancer cells from their normal 
counterparts. Cancer cells typically display abnormalities in the mechanisms 
that regulate normal cell proliferation, differentiation, and survival. Taken 
together, these characteristic properties of cancer cells provide a description 
of malignancy at the cellular level. 

The uncontrolled proliferation of cancer cells in vivo is mimicked by their 
behavior in cell culture. A primary distinction between cancer cells and 
normal cells in culture is that normal cells display density-dependent 
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inhibition of cell proliferation ( Figure 15.8 V Normal cells proliferate until 
they reach a finite cell density, which is determined in part by the availability 
of growth factors added to the culture medium (usually in the form of 
serum). They then cease proliferating and become quiescent, arrested in the 
G 0 stage of the cell cycle (see Figure 14.6) . The proliferation of most cancer 

cells, however, is not sensitive to density-dependent inhibition. Rather than 
responding to the signals that cause normal cells to cease proliferation and 
enter G Q , tumor cells generally continue growing to high cell densities in 

culture, mimicking their uncontrolled proliferation in vivo. 

A related difference between normal cells and cancer cells is that many 
cancer cells have reduced requirements for extracellular growth factors. As 
discussed in Chapter 13 , the proliferation of most cells is controlled, at least 
in part, by polypeptide growth factors. For some cell types, particularly 
fibroblasts, the availability of serum growth factors is the principal 
determinant of their proliferative capacity in culture. The growth factor 
requirements of these cells are closely related to the phenomenon of density- 
dependent inhibition, since the density at which normal fibroblasts become 
quiescent is proportional to the concentration of serum growth factors in the 
culture medium. 

The growth factor requirements of many tumor cells are reduced compared to 
their normal counterparts, contributing to the unregulated proliferation of 
tumor cells both in vitro and in vivo. In some cases, cancer cells produce 
growth factors that stimulate their own proliferation ( Figure 15.9) . Such 
abnormal production of a growth factor by a responsive cell leads to 
continuous autostimulation of cell division (autocrine growth stimulation), 
and the cancer cells are therefore less dependent on growth factors from 
other, physiologically normal sources. In other cases, the reduced growth 
factor dependence of cancer cells results from abnormalities in intracellular 
signaling systems, such as unregulated activity of growth factor receptors or 
other proteins (e.g., Ras proteins or protein kinases) that were discussed in 
Chapter 13 as elements of signal transduction pathways leading to cell 
proliferation. 

Cancer cells are also less stringently regulated than normal cells by cell-cell 
and cell-matrix interactions. Most cancer cells are less adhesive than normal 
cells, often as a result of reduced expression of cell surface adhesion 
molecules. For example, loss of E-cadherin, the principal adhesion molecule 
of epithelial cells, is important in the development of carcinomas (epithelial 
cancers). As a result of reduced expression of cell adhesion molecules, 
cancer cells are comparatively unrestrained by interactions with other cells 
and tissue components, contributing to the ability of malignant cells to 
invade and metastasize. The reduced adhesiveness of cancer cells also results 
in morphological and cytoskeletal alterations: Many tumor cells are rounder 
than normal, in part because they are less firmly attached to either the 
extracellular matrix or neighboring cells. 

A striking difference in the cell-cell interactions displayed by normal cells 
and those of cancer cells is illustrated by the phenomenon of contact 
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inhibition ( Figure 15.10 V Normal fibroblasts migrate across the surface of a 
culture dish until they make contact with a neighboring cell Further cell 
migration is then inhibited, and normal cells adhere to each other, forming an 
orderly array of cells on the culture dish surface. Tumor cells, in contrast, 
continue moving after contact with their neighbors, migrating over adjacent 
cells, and growing in disordered, multilayered patterns. Not only the 
movement but also the proliferation of many normal cells is inhibited by cell- 
cell contact, and cancer cells are characteristically insensitive to such contact 
inhibition of growth. 

Two additional properties of cancer cells affect their interactions with other 
tissue components, thereby playing important roles in invasion and 
metastasis. First, malignant cells generally secrete proteases that digest 
extracellular matrix components, allowing the cancer cells to invade adjacent 
normal tissues. Secretion of collagenase, for example, appears to be an 
important determinant of the ability of carcinomas to digest and penetrate 
through basal laminae to invade underlying connective tissue (see Figure 
15.5) . Second, cancer cells secrete growth factors that promote the formation 
of new blood vessels ( angiogenesis ). Angiogenesis is needed to support the 
growth of a tumor beyond the size of about a million cells, at which point 
new blood vessels are required to supply oxygen and nutrients to the 
proliferating tumor cells. Such blood vessels are formed in response to 
growth factors, secreted by the tumor cells, that stimulate proliferation of 
endothelial cells in the walls of capillaries in surrounding tissue, resulting in 
the outgrowth of new capillaries into the tumor. The formation of such new 
blood vessels is important not only in supporting tumor growth, but also in 
metastasis. The actively growing new capillaries formed in response to 
angiogenic stimulation are easily penetrated by the tumor cells, providing a 
ready opportunity for cancer cells to enter the circulatory system and begin 
the metastatic process. 

Another general characteristic of most cancer cells is that they fail to 
differentiate normally. Such defective differentiation is closely coupled to 
abnormal proliferation, since, as discussed in Chapter 14 , most fully 
differentiated cells either cease cell division or divide only rarely. Rather 
than carrying out their normal differentiation program, cancer cells are 
usually blocked at an early stage of differentiation, consistent with their 
continued active proliferation. 

The leukemias provide a particularly good example of the relationship 
between defective differentiation and malignancy. All of the different types 
of blood cells are derived from a common stem cell in the bone marrow (see 
Figure 14.44 ). Descendants of these cells then become committed to specific 
differentiation pathways. Some cells, for example, differentiate to form 
erythrocytes whereas others differentiate to form lymphocytes, granulocytes, 
or macrophages. Cells of each of these types undergo several rounds of 
division as they differentiate, but once they become fully differentiated, cell 
division ceases. Leukemic cells, in contrast, fail to undergo terminal 
differentiation ( Figure 15.11) . Instead, they become arrested at early stages 
of maturation at which they retain their capacity for proliferation and 
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continue to reproduce. 

As discussed in Chapter 13 , programmed cell death , or apoptosis . is an 
integral part of the differentiation program of many cell types, including 
blood cells. Many cancer cells fail to undergo apoptosis, and therefore 
exhibit increased life spans compared to their normal counterparts. This 
failure of cancer cells to undergo programmed cell death contributes 
substantially to tumor development. For example, the survival of many 
normal cells is dependent on signals from growth factors or from the 
extracellular matrix that prevent apoptosis. In contrast, tumor cells are often 
able to survive in the absence of growth factors required by their normal 
counterparts. Such a failure of tumor cells to undergo apoptosis when 
deprived of normal environmental signals may be important not only in 
primary tumor development but also in the survival and growth of metastatic 
cells in abnormal tissue sites. Normal cells also undergo apoptosis following 
DNA damage, while many cancer cells fail to do so. In this case, the failure 
to undergo apoptosis contributes to the resistance of cancer cells to 
irradiation and many chemotherapeutic drugs, which act by damaging DNA. 
Abnormal cell survival, as well as cell proliferation, thus plays a major role 
in the unrelenting growth of cancer cells in an animal. ♦ top 

Transformation of Cells in Culture 

The study of tumor induction by radiation, chemicals, or viruses requires 
experimental systems in which the effects of a carcinogenic agent can be 
reproducibly observed and quantitated. Although the activity of carcinogens 
can be assayed in intact animals, such experiments are difficult to quantitate 
and control. The development of in vitro assays to detect the conversion of 
normal cells to tumor cells in culture, a process called cell transformation, 
therefore represented a major advance in cancer research. Such assays are 
designed to detect transformed cells, which display the in vitro growth 
properties of tumor cells, following exposure of a culture of normal cells to a 
carcinogenic agent. Their application has allowed experimental analysis of 
cell transformation to reach a level of sophistication that could not have been 
attained by studies in whole animals alone. 

The first and most widely used assay of cell transformation is the focus 
assay, which was developed by Howard Temin and Harry Rubin in 1958. 
The focus assay is based on the ability to recognize a group of transformed 
cells as a morphologically distinct "focus" against a background of normal 
cells on the surface of a culture dish ( Figure 15.12 ). The focus assay takes 
advantage of three properties of transformed cells: altered morphology, loss 
of contact inhibition, and loss of density-dependent inhibition of growth. The 
result is the formation of a colony of morphologically altered transformed 
cells that overgrow the background of normal cells in the culture. Such foci 
of transformed cells can usually be detected and quantified within a week or 
two after exposure to a carcinogenic agent. In general, cells transformed in 
vitro are able to form tumors following inoculation into susceptible animals, 
supporting in vitro transformation as a valid indicator of the formation of 
cancer cells, ♦top 
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ancestor of domestic dogs. Thus, dogs from 
these breeds may be the best living representa- 
tives of the ancestral dog gene pool. It is notable 
that several breeds commonly believed to be of 
ancient origin, such as the Pharaoh Hound and 
Ibizan Hound, are not included in this group. 
These are often thought to be the oldest of all 
dog breeds, descending directly from the ancient 
Egyptian dogs drawn on tomb walls more than 
5000 years ago. Our results indicate, however, 
that these two breeds have been recreated in 
more recent times from combinations of other 
breeds. Thus, although their appearance 
matches the ancient Egyptian sight hounds, 
their genomes do not. Similar conclusions ap- 
ply to the Norwegian Elkhound, which clus- 
ters with modern European breeds rather than 
with the other Arctic dogs, despite reports of 
direct descent from Scandinavian origins more 
than 5000 years ago (7, 24). 

The large majority of breeds appears to rep- 
resent a more recent radiation from shared Eu- 
ropean stock. Although the individual breeds are 
genetically differentiated, they appear to have 
diverged at essentially the same time. This radi- 
ation probably reflects the proliferation of dis- 
tinct breeds from less codified phenotypic vari- 
eties after the introduction of the breed concept 
and the creation of breed clubs in Europe in the 
1800s. A more sensitive cluster analysis was 
able to discern additional genetic structure of 
three subpopulations within this group. One con- 
tains Mastiff-like breeds and appears to reflect 
shared morphology derived from a common an- 
cestor. Another includes Shetland Sheepdog, the 
two Belgian Sheepdogs, and Collie, and may 
reflect shared ancestral herding behavior. The 
remaining population is dominated by a prolif- 
eration of breeds dedicated to various aspects of 
the hunt. For these breeds, historical and breed 
club records suggest highly intertwined blood- 
lines, consistent with our results. 

Dog breeds have traditionally been 
grouped on the basis of their roles in human 
activities, physical phenotypes, and histor- 
ical records. Here, we defined an indepen- 
dent classification based on patterns of ge- 
netic variation. This classification supports 
a subset of traditional groupings and also 
reveals previously unrecognized connec- 
tions among breeds. An accurate under- 
standing of the genetic relationships among 
breeds lays the foundation for studies 
aimed at uncovering the complex genetic 
basis of breed differences in morphology, 
behavior, and disease susceptibility. 
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and phosphatases (PTPs). Although a variety of 
PTK genes have been directly linked to tumor- 
igenesis through somatic activating mutations 
(3-6), only a few PTP genes have been impli- 
cated in cancer (7-10). Moreover, it is not 
known how many or how frequently members 
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Tyrosine phosphorylation, regulated by protein tyrosine phosphatases (PTPs) 
and kinases (PTKs), is important in signaling pathways underlying tumorigen- 
esis. A mutational analysis of the tyrosine phosphatase gene superfamily in 
human cancers identified 83 somatic mutations in six PTPs (PTPRF, PTPRG, 
PTPRT, PTPN3, PTPN13, PTPN14), affecting 26% of colorectal cancers and a 
smaller fraction of lung, breast, and gastric cancers. Fifteen mutations were 
nonsense, frameshift, or splice-site alterations predicted to result in truncated 
proteins lacking phosphatase activity. Five missense mutations in the most 
commonly altered PTP (PTPRT) were biochemically examined and found to 
reduce phosphatase activity. Expression of wild-type but not a mutant PTPRT 
in human cancer cells inhibited cell growth. These observations suggest that the 
mutated tyrosine phosphatases are tumor suppressor genes, regulating cellular 
pathways that may be amenable to therapeutic intervention. 
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of the PTP gene family are altered in any 
particular cancer type. We have systematically 
addressed these issues by comprehensive mu- 
tational analysis of the PTP gene superfamily in 
colorectal tumors. 

The PTP gene superfamily is composed of 
three main families: (i) the classical PTPs, in- 
cluding the receptor PTPs (RPTPs) and the 
nonreceptor PTPs (NRPTPs); (ii) the dual spec- 
ificity phosphatases (DSPs), which can dephos- 
phorylate serine and threonine in addition to 
tyrosine residues; and (iii) the low molecular 
weight phosphatases (LMPs) (/). Using an ap- 
proach similar to that described in a recent bioin- 
formatic analysis of PTPs in the mouse genome 
(7/), we employed a combination of Hidden 
Markov Models representing catalytic domains 
of members of the PTP superfamily to identify 
53 classical PTPs (21 RPTPs and 32 NRPTPs), 
33 DSPs, and one LMP in the human genome 
(12). This analysis revealed a set of genes rep- 
resenting all known human PTPs (13) as well as 
seven putative PTPs. 

As an initial screen to evaluate whether these 
phosphatases are genetically altered in human 
cancer, we analyzed the coding exons of all 87 
members of this gene superfamily in 18 colorec- 
tal cancers. A total of 1375 exons from all an- 
notated RPTPs, NRPTPs, DSPs, and LMPs were 
extracted from genomic databases (12). These 
exons were amplified by polymerase chain reac- 
tion from cancer genomic DNA samples and 
directly sequenced with dye terminator chemis- 
try (12). Whenever a presumptive mutation was 
identified, we attempted to determine whether it 
was somatically acquired (i.e., tumor specific) by 
examining the sequence of the gene in genomic 
DNA from normal tissue of the relevant patient. 

From the 3.3 Mb of sequence information 
obtained, we identified six genes containing so- 
matic mutations, including three members of the 
RPTP subfamily (PTPRF, PTPRG, and PTPRT) 
and three members of the NRPTP subfamily 
(P77W3, PTPN13, and PTPN14). These six 
genes were then further analyzed for mutations 
in another 1 57 colorectal cancers. Through this 
strategy, we identified 77 mutations in the six 
genes, in aggregate affecting 26% of the colo- 
rectal tumors analyzed (table SI and Fig. 1). 
Examination of these six genes in seven other 
tumor types identified PTPRT mutations in two 
of 1 1 (18%) lung cancers and two of 12 gastric 
cancers (17%), and PTPRF mutations in one of 
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1 1 (9%) lung cancers and one of 1 1 (9%) breast 
cancers. No mutations were identified in 12 pan- 
creatic cancers, 12 ovarian cancers, 12 medullo- 
blastomas, or 1 2 glioblastomas (table SI and Fig, 
1). In total, 83 nonsynonymous mutations were 
observed, all of which were somatic in the can- 
cers that could be assessed (12). 

Fifteen of the 83 mutations were nonsense, 
frameshift, or splice-site alterations, all of which 
were predicted to result in aberrant or truncated 
proteins. In 16 tumors both alleles of the phos- 
phatase gene appeared to be mutated, a charac- 
teristic often associated with tumor suppressor 
genes. The majority of tumors with PTP gene 
mutations also contained mutations in KRAS or 
BRAF, and nine tumors contained alterations in 
previously reported tyrosine kinase genes (table 
SI). Thus, the mutant phosphatases identified in 
this study are likely to operate through cellular 
pathways distinct from those associated with 
previously identified mutant kinases. 

Analysis of mutations in tumors is compli- 
cated by the fact that mutations can arise either 
as functional alterations affecting key genes un- 
derlying the neoplastic process or as nonfunc- 
tional "passenger" changes. The multiple waves 
of clonal expansion and selection that occur 
throughout tumorigenesis lead to fixation of any 
mutation that had previously occurred in any 
predecessor cell, regardless of whether the mu- 
tation was actually responsible for the clonal 
expansion. Two independent lines of evidence 
suggest that the sequence alterations we 
observed are functional. First, the ratio of non- 
synonymous to synonymous mutations pro- 
vides an indication of selection, as synony- 
mous alterations usually do not exert a growth 
advantage. There were no somatic synony- 
mous mutations detected in the colorectal can- 
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cers analyzed, resulting in a ratio of nonsyn- 
onymous to synonymous mutations of 77 to 0, 
much higher than the expected 2 : 1 ratio for 
nonselected passenger mutations (P < 1 X 
10~ 6 ). Second, the prevalence of mutations in 
the coding regions of the analyzed genes was 
~19 per Mb of tumor DNA, similar to the 
prevalence of functional somatic alterations 
observed in other gene families [e.g., the ty- 
rosine kinome (6)] and significantly higher 
than the prevalence of nonfunctional alter- 
ations previously observed in the cancer ge- 
nomes (~1 per Mb, P < 0.01) (14). These 
data support the idea that these mutations were 
the targets of selection during tumorigenesis. 

The great majority of the nonsense and 
frameshift mutations (Fig. I) would result in 
polypeptides devoid of C-terminal phosphatase 
catalytic domains, thereby leading to inactivation 
of the phosphatase. To evaluate whether tumor- 
specific point mutations alter phosphatase activ- 
ity, we biochemically tested mutant versions of 
PTPRT, the most frequently mutated PTP in the 
superfamily. Mutations in both intracellular PTP 
domains (Dl and D2) were evaluated. His- 
tagged versions of the catalytic region of wild- 
type PTPRT, two Dl mutants (Q987K and 
N1128I), and three D2 mutants (R1212W, 
R1346L, and T1368M) were produced in 
bacteria and analyzed for phosphatase activ- 
ity by using 6,8-difluoro-4-methylumbel- 
liferyl phosphate (DiFMUP) as a substrate 
(Fig. 2) (12). All Dl and D2 mutants had 
reduced phosphatase activity compared with 
the wild-type protein (Fig. 2). The kinetic 
parameter K cal was reduced in both Dl mu- 
tants, while K^, was increased in all three D2 
mutants, suggesting that mutations in the two 
domains may have different effects on enzy- 



Fig. 1. Distribution of mutations in PTPRT, PTPN13, PTP N 14, PTPRC, PTPRF, and PTPN3. Black 
arrows indicate location of missense mutations, red arrows indicate location of nonsense mutations 
or frameshifts, and boxes represent functional domains (B41, band 41; CA, carbonic anhydrase; 
FN3, fibronectin type III; IG, immunoglobulin; MAM, meprin/A5/PTPu,; PDZ, postsynaptic density, 
discs large, zonula occludans; PTPc, catalytic phosphatase domain). Black stars indicate PTPRT 
mutants evaluated for phosphatase activity (Fig. 2), and the red star indicates PTPRT mutant 
evaluated for effects on cell proliferation (Fig. 3). 
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matic activity. Although it has been thought 
that the D2 domain is usually catalytically 
inactive (/), our results are consistent with 
recent studies that show that the D2 domain is 
important for phosphatase activity in some 
receptor phosphatases (15). 

These biochemical data on missense muta- 
tions, coupled with the large number of truncat- 
ing mutations, suggested that PTPRT functions 
as a tumor suppressor gene. To determine 
whether PTPRT expression can inhibit tumor 
cell growth, we transfected wild-type PTPRT 
into HCT116 colorectal cancer cells (12). An 
identical expression vector containing an R632X 
mutant of PTPRT was used for comparison. 
Wild-type PTPRT potently inhibited cell growth 
in this assay, as seen by the substantial decrease 
in the number of neomycin-resistant colonies 
compared with the R632X mutant or with vector 
alone (Fig. 3, A and B). Similar results with 
wild-type and mutant PTPRT were also ob- 
served in DLD1 colorectal cancer cells. 

The combination of these genetic, biochem- 
ical, and cellular data suggest that PTPRT and 
the other identified phosphatases are likely to act 
as tumor suppressors. This is consistent with the 
function of other phosphatases implicated in 
tumorigenesis (7,8, 16) and with the general 
role of phosphatases in inhibiting various 
growth-promoting signaling pathways (2). 



The absence of biallelic mutations in a sub- 
set of the analyzed tumors suggests that 
some alterations may act in a dominant- 
negative fashion or may affect gene dosage, 
mechanisms that have been previously in- 
volved in inacttvation of other tumor sup- 
pressor genes (17, 18). 

Little is known about the functional role of 
the tyrosine phosphatases discussed here. 
PTPN 1 3 appears to be involved in apoptosis (19) 
and may be partly responsible for the antitumor 
effects of tamoxifen (20). Overexpression of 
PTPN3 inhibits growth of N[H/3T3 cells, possi- 
bly through interaction with valosin-containing 
protein (VCP/p97) (21). PTPN 14 and PTPRF 
are thought to play a role in cell adhesion by 
regulating tyrosine phosphorylation of adherens 
junction proteins (22, 23). Because increased 
phosphorylation of adherens junctions has been 
shown to increase cell motility and migration 
(22, 24), mutational inactivation of these genes 
may be an important step in cancer ceil invasion 
and metastasis. PTPRG maps to chromosome 
3pl4.2, a region frequently lost in lung, renal, 
and early-stage breast tumors, and is thought to 
be a target of the translocation at 3pl4 in familial 
renal cell carcinoma (25-27). However, no point 
mutations in PTPRG (28) or any of the other 
genes identified here have been previously de- 
scribed in any cancer. PTPRT is expressed in the 
developing central nervous system and in the 
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Fig. 2. Evaluation of phosphatase activity of 
mutant PTPRT. (A) Saturation kinetics of wild- 
type and mutant PTPRT. His-tagged versions of 
PTPRT protein segments comprising the two 
catalytic domains containing wild-type (WT) 
and tumor-specific mutant sequences were ex- 
pressed in bacteria and purified using nickel- 
affinity chromatography. Equal amounts of WT 
and mutant proteins were used to evaluate 
enzyme kinetics. The rate of hydrolysis of sub- 
strate (DiFMUP) is plotted against increasing 
substrate concentration. Data were fitted to 
the Michaelis-Menton equation, and the result- 
ing kinetic parameters of WT and mutant pro- 
teins are indicated in (B). 




WT R632X Vector 

Fig. 3. PTPRT overexpression suppresses growth 
of human cancer cells. (A) HCT116 colorectal 
cancer cells were transfected with wild-type 
(WT) PTPRT construct, truncated R632X mu- 
tant PTPRT construct, or empty pCI-Neo vector. 
The photographs show colonies stained with 
crystal violet after 14 days of geneticin selec- 
tion. (B) Number of resistant colonies (mean of 
two 25 cm 2 flasks) for WT PTPRT, mutant 
PTPRT, and empty vector. 



adult cerebellum (29) and had not been thought 
to play a role in the growth or differentiation of 
other tissues. We have found that PTPRT is 
expressed in a variety of human tissues, includ- 
ing normal colon epithelium as well as cells 
derived from colorectal cancers (fig. SI). 

Phosphatases affect signaling pathways that 
may be amenable to therapeutic intervention in 
cancer cells (2). Although reactivation of inca- 
pacitated phosphatases is likely to be pharmaco- 
logically challenging, identification of the corre- 
sponding kinases that phosphorylate substrates 
normally regulated by the mutant phosphatases 
could provide novel therapeutic targets. Like the 
analysis of genetic alterations in tyrosine kinases 
(6), the present study suggests the possibility of 
individualized therapy based on the mutant 
phosphatases present in specific tumors. Con- 
ceivably, this approach would have broad thera- 
peutic use, because more than 50% of colorectal 
tumors analyzed to date have alterations in at 
least one member of the tyrosine phosphatome 
or kinome. 
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Cell-cell adhesion regulates processes important in embryonal development, normal physiology, 
and cancer progression. It is regulated by various mechanisms including tyrosine phosphoryla- 
tion. We have previously shown that the protein tyrosine phosphatase Pez is concentrated at 
intercellular junctions in confluent, quiescent monolayers but is nuclear in cells lacking cell-cell 
contacts. We show here with an epithelial cell model that Pez localizes to the adherens junctions 
in confluent monolayers. A truncation mutant lacking the catalytic domain acts as a dominant 
negative mutant to upregulate tyrosine phosphorylation at adherens junctions. We identified 
j3-catenin, a component of adherens junctions, as a substrate of Pez by a "substrate trapping" 
approach and by in vitro dephosphorylation with recombinant Pez. Consistent with this, ectopic 
expression of the dominant negative mutant caused an increase in tyrosine phosphorylation of 
/3-catenin, demonstrating that Pez regulates the level of tyrosine phosphorylation of adherens 
junction proteins, including j3-catenin. Increased tyrosine phosphorylation of adherens junction 
proteins has been shown to decrease cell-cell adhesion, promoting cell migration as a result. 
Accordingly, the dominant negative Pez mutant enhanced cell motility in an in vitro "wound" 
assay. This suggests that Pez is also a regulator of cell motility, most likely through its action on 
cell-cell adhesion. 



INTRODUCTION 

Cell-cell adhesion regulates diverse cellular functions in- 
cluding cell proliferation, migration, and apoptosis (Vlem- 
inckx and Kemler, 1999). One important cell-cell adhesion 
system, the adherens junction (AJ), is mediated by a family 
of homophilic receptors, the cadherins (Steinberg and Mc- 
Nutt, 1999). The strength of cadherin-mediated adhesion is 
regulated by lateral clustering of cadherin molecules at the 
plasma membrane and also through the linkage of its intra- 
cellular cytoplasmic tail to the actin cytoskeleton. /3-Catenin, 
a structural component of AJs and signal transducer of the 
wnt signaling pathway, is crucial for cross-linking cadherins 
to the actin cytoskeleton through another intermediate, 
a-catenin (Gumbiner, 1995;Cowin and Burke, 1996). 
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Reversible tyrosine phosphorylation, catalyzed by the op- 
posing actions of protein tyrosine kinases (PTKs) and pro- 
tein tyrosine phosphatases (PTPs), is an important mecha- 
nism for regulating the linkage of cadherins to the 
cytoskeleton. A number of PTKs and PTPs have been found 
to be associated with AJs (Steinberg and McNutt, 1999). 
Inhibitors of PTPs have been shown to disrupt cell-cell ad- 
hesion, suggesting that PTPs play a critical role in maintain- 
ing the integrity of AJs (Ay a Ion and Geiger, 1997). The 
observation that phosphorylation of a critical tyrosine resi- 
due, Tyr654, on /3-catenin results in its dissociation from 
E-cadherin (Roura et al, 1999), verifies that tyrosine phos- 
phorylation is an important mechanism for regulating the 
E-cadherin-catenin linkage. Tyrosine phosphorylation has 
also been reported to disrupt the j3-catenin-a-catenin link- 
age (Ozawa and Kemler, 1998), although the critical ty- 
rosine(s) in this case has not been determined. These obser- 
vations suggest that multiple targets for tyrosine 
phosphorylation exist to regulate cell-cell adhesion. 

The PTP Pez (PTPD2/PTP36) is a 130-kDa cytosolic (non- 
transmembrane) PTP (Smith et al, 1995) expressed in a num- 
ber of tissues. It is a member of the FERM (four-point-one, 
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ezrin, radixin, moesin) family of PTPs characterized by a 
conserved N-terminal FERM domain (Chishti et al, 1998) 
and a C-terminal PTP catalytic domain separated by an 
intervening region. We recently showed that the subcellular 
localization of Pez is regulated in both HeLa and human 
umbilical vein endothelial cells (HUVEC); in cells grown to 
confluence Pez is localized to the cytosol, where it is con- 
centrated at intercellular junctions, but it is predominantly 
nuclear in sparsely plated cells that have not yet formed 
extensive cell-cell contacts (Wadham et al, 2000). Other fac- 
tors also regulate the subcellular localization of Pez, includ- 
ing TGF/3, which inhibits translocation of Pez from the cy- 
tosol to the nucleus, and serum, which promotes the 
accumulation of Pez in the nucleus (Wadham et al, 2000). 
Together these findings suggest that Pez could have multi- 
ple roles, involving the dephosphorylation of different sub- 
strates depending on whether it is in the nucleus or at 
intercellular junctions. Its presence at the intercellular junc- 
tions of confluent monolayers suggests that it may regulate 
the assembly or disassembly of adhesion complexes. 

To elucidate the function of Pez, we used a "substrate 
trapping" approach (Flint et al, 1997) in combination with 
the generation and overexpression of a dominant negative 
form of Pez to identify its substrates. We identified 0-catenin 
as a substrate and show that the dominant negative Pez 
enhances both tyrosine phosphorylation of adherens junc- 
tions and cell motility. 



MATERIALS AND METHODS 
Tissue Culture and Cell Lines 

MDCK and HEK 293 cell lines were cultured in DMEM supple- 
mented with 10% FBS. HUVECS were obtained from discarded 
umbilical cords and cultured in M199 medium supplemented with 
20% FBS and endothelial growth factors as previously described 
(Wall et al, 1978). Transient transfections of HEK293 cells were 
carried out using Lipofectamine 2000 (Invitrogen Life Technologies/ 
Carlsbad, CA) according to the manufacturer's instructions. Stable 
MDCK clones were generated by transfection by standard calcium 
phosphate coprecipitation and transfectants selected by G418 (Pro- 
mega, Madison, WI) resistance. 

Antibodies 

The polyclonal Pez antibody had previously been characterized 
(Wadham et al, 2000). All other antibodies used were purchased: 
monoclonal Flag epitope antibody (M2) from AM RAD Biotech (Vic- 
toria, Australia), /3-catenin, y-catenin, and E-cadherin monoclonal 
antibodies from Transduction Laboratories, a-catenin mAb from 
either Transduction Laboratories (Lexington, KY) or Zymed (San 
Francisco, CA), monoclonal antiphosphotyrosine antibody (PY100) 
from New England Biolabs (Beverley, MA), and polyclonal ZO-1 
and monoclonal pl20catenin antibody from Zymed. 

Generation of Mutant Pez Constructs 

Isolation of the human Pez cDNA and generation of a Flag epitope- 
tagged construct in the mammalian expression vector, pcDNA3 
(Invitrogen) has been described (Wadham ct al, 2000). The D 1079 A 
and R1127M mutations in Pez cDNA were made by site-directed 
mutagenesis using PCR. AFERM (amino acids 337-1187)- and APTP 
(amino acids 1-932) -Pez were generated by PCR using the appro- 
priate primers to remove the entire FERM or PTP domain, respec- 
tively. All constructs were tagged with the Flag epitope. The se- 



quences of all mutated constructs were verified by sequencing from 
both the sense and antisense directions. 

Generation of GST-Pez Fusion Proteins 

wt-Pez and ST-Pez coding sequences were excised from the 
pcDNA3 constructs described above and cloned into the pGEX 4T-1 
vector (Amersham Biosciences, Piscataway, NJ) to generate GST- 
fusion Pez proteins. The constructs were transformed into BL21- 
Codon Plus (DE3)-R1L Escherichia coli (Stratagene, La Jolla, CA) for 
protein expression. Cultures were induced with 0.15 mM IPTG for 
2 h at ambient temperature, and the GST-fusion proteins were 
purified on glutathione sepharose. The amounts of full-length fu- 
sion proteins produced were determined by Coomassie blue stain- 
ing after PAGE. Equal amounts of GST-wt-Pez and GST-ST-Pez 
protein were used for in vitro dephosphorylation of 0-catenin. 

Substrate Trapping 

Newly confluent HUVEC lysates, used as a source of tyrosine- 
phosphorylated proteins, were prepared as described (Flint et al, 
1997). Briefly, the cells were incubated for 30 min with 50 /xM 
sodium pervanadate to enrich for tyrosine-phosphorylated proteins, 
washed in phosphate-buffered saline (PBS), and lysed in ST buffer 
(50 mM HEPES, pH 7.5, 150 mM NaCl, 150 mM NaF, 10 mM sodium 
pyrophosphate, 5 mM EDTA, 1% Triton X-100, and protease inhib- 
itor cocktail [P2714, Sigma, St Louis, MOD containing 1 mM sodium 
orthovanadate at 4°C The lysates were incubated on ice for 30 min 
in the presence of 5 rrtM iodoacetic acid to irreversibly inactivate 
endogenous PTPs. Unreacted iodoacetic acid was inactivated with 
10 mM DTT. The lysates were then frozen on liquid nitrogen and 
stored at -70°C 

Flag-tagged wt-Pez or ST-Pez was transiently transfected into 
HEK293 cells and transfectants lysed in ST buffer 48 h after trans- 
fection. Equal amounts of protein from each lysate were immuno- 
precipitated (in the absence of orthovanadate) with an anti-Flag 
(M2) antibody precoupled to protein A sepharose beads. The Pez 
immunoprecipitates were washed three times in ST buffer, added to 
the phosphotyrosine enriched HUVEC lysates, and rocked at 4°C 
for 2 h. The beads were washed three times with ST buffer, boiled in 
Laemmli sample buffer, and bound proteins resolved by 8% SDS- 
PAGE. To detect tyrosine-phosphorylated proteins "pulled-down" 
by either wt or ST Pez, Western blots were performed using an 
antiphosphotyrosine antibody (PY100). 

Immunoprecipitations and Western Blots 

Immunoprecipitations were carried out after lysis of cells in ice-cold 
ST buffer containing 1 mM orthovanadate. Lysates were precleared 
with 20 /il of protein A-sepharose for 30 min at 4°C. Protein con- 
centration was assayed using Bradford Reagent from Bio-Rad (Her- 
cules, CA). Equal amounts (1-5 mg) of protein were incubated with 
2 /xg of primary antibodies supplemented with 20 fi\ packed protein 
A-sepharose for 1 h at 4°C. After washing, bound proteins were 
eluted by boiling in Laemmli sample buffer for 5 min separated by 
8% SDS-PAGE and transferred to PVDF membrane (Hybond-P, 
Amersham Pharmacia Biotech) for Western blotting. Western blot- 
ting was carried out after blocking with 5% milk, 0.1% Triton X-100 
in PBS using the indicated antibodies and developed using HRP- 
conjugated secondary antibody (Immunotech, Marseille, France) 
and ECL (Amersham Pharmacia Biotech). For quantitation, Western 
blots were developed with ECL-Plus (Amersham Pharmacia Bio- 
tech), and fluorescence intensity was imaged using a Molecular 
Dynamics Typhoon 9410 (Amersham Biosciences, United Kingdom) 
variable mode imager. 

Immunofluorescence 

MDCK stable cells lines expressing either wt-Pez, APTP-Pez, or 
AFERM-Pez were plated at confluent density onto fibronectin 
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coated glass LabTek chamber slides (Nalge, Nunc International, 
NapervUle, IL) and incubated for 2-3 d before staining. The cells 
were fixed in 4% paraformaldehyde /PBS for 10 min, quenched with 
10 mg/ml sodium borohydride for 15 min, and then permeabilized 
by treatment with 0.1% Triton X-100. Primary antibodies were used 
at 1:100 dilution and binding detected by incubation with either 
fluorophore-coupled secondary antibodies or biotinylated second- 
ary antibodies followed by fluorophore-conjugated streptavidin, as 
indicated (Molecular Probes, Eugene, OR). 

Epifluorescence microscopy was performed on an Olympus 
BX-51 microscope equipped with excitation filters for Alexa Fluor 
594/Texas red and fluorescein (494 nm), acquired to a Cool Snap FX, 
charge-coupled device (CCD) camera (Photometries, Phoenix, AZ). 
Images were adjusted for brightness and contrast with V 4 "*" soft- 
ware (Digital Optics Ltd., Auckland, New Zealand). The line-pro- 
filing feature of this software was used to plot the intensity vs. 
position of different fluorophores along a path through the cell 
monolayer, in cells that had been costained for two proteins. Con- 
focal microscopy was performed using a 60 X oil-immersion objec- 
tive on an Olympus 1X70 inverted microscope linked to a Bio-Rad 
Radiance 2100 confocal microscope. Sequential scans of each fluoro- 
phore separately were carried out for two-color colocalization stud- 
ies. 



Wounding Assay 

MDCK stable cell lines were plated onto six-well trays at densities 
that would give confluent monolayers after 24 h. Confluent mono- 
layers were incubated a further 48 h to allow intercellular junctions 
to mature before being serum-starved for 24 h. A linear wound was 
generated on the monolayers by scraping with the edge of a cell 
scraper. Unattached cells were washed off with agitation. Cells were 
photographed at the same point on a grid at the time of scraping 
and again at 24 h later. The difference in width of the wound 
between the two edges at the time of scraping and 24 h later was 
measured and represents the distance migrated. Each line was 
plated and wounded in triplicate. 



RESULTS 

Pez Colocalizes with E-cadherin at AJs 

We previously observed in confluent endothelial monolay- 
ers that endogenous Pez localizes to the intercellular junc- 
tions (Wadham et al, 2000). Here, we investigate the local- 
ization of Flag-tagged-Pez stably expressed in MDCK cells, 
a polarized epithelial cell line in which cell-cell adhesion is 
well characterized. For subsequent investigations of the 
function of Pez, truncation mutants of Pez that lack either 
the catalytic (APTP-Pez) or FERM (AFERM-Pez) domain 
(Figure 1A) were created and their subcellular localization 
when stably expressed in MDCK epithelial cell lines were 
also examined. wt-Pez and both truncated Pez mutants, 
examined by epifluorescent microscopy, were similarly lo- 
calized to what appears to be the intercellular junctions 
(Figure IB). To further confirm that the localization of Pez 
was indeed at intercellular junctions rather than at the cell 
surface, optical sectioning using a confocal microscope was 
performed on wt-Pez-MDCK cells that had been costained 
for the Flag-epitope (on Pez) and E-cadherin (a marker of 
AJs). The data showed that Pez precisely colocalized with 
E-cadherin at basolateral membranes both along the z-axis 
(Figure 1, C and D) and in the x-y plane (Figure IE), con- 
firming that it is localizing to the AJs. 



APTP-Pez Is a Potential Dominant Negative 
Mutant of Pez That Causes an Increase in Tyrosine 
Phosphorylation at AJs 

Because APTP-Pez is devoid of the catalytic domain and 
therefore not enzymatically active, but retains the ability to 
localize to intercellular junctions, it can potentially act as a 
dominant negative mutant. If Pez is an AJ PTP that regulates 
the level of tyrosine phosphorylation at AJs, then overex- 
pression of a dominant negative mutant of Pez should result 
in an increase in tyrosine phosphorylation of AJ proteins. 
This was investigated using confluent monolayers of MDCK 
cells overexpressing APTP-Pez. Cells were serum-starved 
followed by 10 min serum stimulation before staining with 
an antiphosphotyrosine antibody. Epifluorescence micros- 
copy snowed that there was a markedly higher level of 
tyrosine phosphorylation at intercellular junctions (marked 
by costaining with an anti-ZO-1 antibody) in APTP-Pez- 
transfected cells compared with the empty vector control or 
wt-Pez trans fectants (Figure 2A), Under these experimental 
conditions no tyrosine phosphorylation was detected at the 
intercellular junctions of empty vector- and wt-Pez-trans- 
fected cells. This is best demonstrated when the fluorescence 
intensities resulting from both the phosphotyrosine and 
ZO-1 antibodies were quantitated across several cell bound- 
aries (Figure 2 A, right column). 

To determine the exact location of the tyrosine-phospho- 
rylated proteins induced by overexpression of the putative 
dominant negative mutant APTP-Pez, optical sectioning 
with a confocal microscope was performed on confluent 
APTP-Pez-transfected cells costained with both the an- 
tiphosphotyrosine and anti-E-cadherin antibodies. As with 
Pez, the tyrosine phosphorylation induced by APTP-Pez 
precisely colocalized with E-cadherin both along the Z-axis 
(Figure 2, B and C) and in the x-y plane, confirming that the 
increased tyrosine phosphorylation occurred at AJs. In ad- 
dition, these data also suggest that the tyrosine-phosphory- 
lated substrates remained in the proximity of the plasma 
membrane and did not translocate to other parts of the cell 
Induction of tyrosine phosphorylation at intercellular junc- 
tions has also been confirmed using another potential dom- 
inant negative mutant, the R1127M point-mutant (Figure 
2D; R1127 of Pez is the equivalent of R221 of PTP1B, which 
when mutated leads to inactivation of its PTP activity (Flint 
etal, 1997). 

The AJ Protein p-Catenin Is a Substrate of Pez and 
Coimmunoprecipitates with Pez 

Data obtained so far suggested that Pez was an AJ PTP and 
in concordance with this hypothesis, a putative dominant 
negative mutant of Pez caused an increase in tyrosine phos- 
phorylation of AJs. A number of components of the AJ 
complex can be tyrosine phosphorylated, leading to alter- 
ations in their functions (Steinberg and McNutt, 1999). We 
therefore used a substrate trapping approach to identify 
substrates of Pez at the AJ. 

Aspl81 of PTP1B is an essential residue for catalytic ac- 
tivity of PTP1B, which when mutated to Ala results in a 
catalytically inactive substrate trapping (ST) mutant (Flint et 
al, 1997). Sequence alignment of the phosphatase domains 
of Pez and PTP IB indicate that Asp 1079 of Pez corresponds 
to Aspl81 of PTP1B. To generate a ST mutant of Pez (denot- 
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Figure 1. Flag-tagged wt-Pez A. 
and truncation mutants of Pez lo- 
calize to the AJs of confluent 1 340 
monolayers of MDCK cells. (A) HFERM I 
Schematic showing the structure 

of wt-, APTP-, and AFERM-Pez. 1 

(B) Epifluorescence microscopy of 
confluent monolayers of parental 
MDCK cells and stable MDCK 
cell lines expressing wt-, APTP-, 
and AFERM-Pez, labeled with an 
antibody against the Flag epitope 
and detected by indirect immuno- 
fluorescence using FITC-conju- 
gated anti-mouse antibody. Spe- 
cific labeling was observed in all 
Pez-expressing cell lines at the in- 
tercellular junctions. No staining 
was observed in the parental 
cells. (C) Z-series obtained by 
confocal laser scanning micros- 
copy showing wt-Pez-transfected 
MDCK cells double-stained with 
the anti-E-cadherin (top row) and 
anti-Flag (bottom row) antibodies 
to indicate colocalization of Pez 
with E-cadherin along the z-axis. 
Anti-E-cadherin antibody was Flag 
detected with phycoerythrin-con- (P©*) 
jugated anti-mouse IgG2a and an- 
ti-Flag antibody with biotinylated 
anti-mouse IgGl antibody and 
FITC-conjugated streptavidin. Z- 
steps were carried out at 0.5-^.m 
intervals, and alternate optical 
sections are shown. Top and bot- 
tom panels from each row show 
E-cadherin and Pez staining, re- 
spectively, from the same optical 
section. (D) Diagrammatic repre- 
sentation of the optical sections in 
the Z-series shown in C, indicat- 
ing x, y, and z axes. (E) High- 
resolution merged image of one 
optical section (represented by 
panels 7 and 7' from C) showing 
colocalization (yellow) in the x-y 
plane of Pez (green) and E-cad- 
herin (red). 
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ed ST-Pez), we mutated D1079 of Pez to Ala and verified 
using an in vitro assay with a tyrosine-phosphorylated pep- 
tide that the catalytic activity of ST-Pez was significantly 
reduced compared with that of wt-Pez (between 10-20% of 
wt-Pez activity; unpublished data). 

Without prior knowledge of the substrates of Pez, specific 
agonists could not be used to trigger their phosphorylation. 
Therefore, treatment with pervanadate, an inhibitor of PTPs, 
was used to upregulate tyrosine phosphorylation of proteins, 
including potential Pez substrates, in vivo (Figure 3A, panel 1). 
After lysis, endogenous PTPs in the HUVEC lysate were sub- 
sequently irreversibly inactivated by treatment with iodoacetic 
acid. The phosphotyrosyl-enriched HUVEC lysate devoid of 



endogenous PTP activities was incubated with either Flag- 
tagged wt- or ST-Pez (expressed in HEK 293 cells) immuno- 
precipitates bound to protein A-sepharose beads (Figure 3A, 
panel 6). Tyrosine-phosphorylated proteins that could interact 
with the wt- or ST-Pez immunoprecipitates, either directly or 
indirectly, were pulled-down and detected by Western blotting 
with, an antiphosphotyrosine antibody. A number of tyrosine- 
phosphorylated proteins of similar staining intensities were 
pulled-down by both wt- and ST-Pez immunoprecipitates (Fig- 
ure 3A, panel 2) but not immunoprecipitates from vector- 
transfected cells (panel 7). However, Band 1 was barely detect- 
able in wt-Pez pull-downs but was clearly present when 
associated with ST-Pez. This could be because Band 1 did not 
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Figure 2. APTP-Pez enhances ty- 
rosine phosphorylation of AJs. (A) 
Epifluorescence microscopy show- 
ing confluent monolayers of 
MDCK cells stably expressing vec- 
tor, wt- or APTP-Pez, double-la- 
beled with phosphotyrosine (PY) 
and ZO-1 antibodies and detected 
by indirect immunofluorescence 
using biotinylated anti-mouse anti- 
body followed by Texas Red-con- 
jugated strep tavidin and Alexa 
fluor 350-conjugated anti-rabbit 
antibody, respectively. From left to 
right columns: phosphotyrosine 
antibody staining, ZO-1 staining to 
show positions of cell-cell contacts, 
merged image of PY staining (false 
colored red) and ZO-1 staining 
(false colored green), and quantita- 
tion of PY and ZO-1 fluorescent in- 
tensities taken along a line connect- 
ing the crosses shown in the 
merged image. (B) Z-series ob- 
tained by confocal laser scanning 
microscopy showing APTP-Pez- 
transfected MDCK cells double- 
stained with the anti-E-cadherin 
(top row) and antiphosphotyrosine 
(bottom row) antibodies to show 
colocalization of phosphotyrosines 
with E-cadherin along the z-axis. 
Anti-E-cadherin antibody was de- 
tected with phycoeiythrin-conju- 
gated anti-mouse IgG2a, and an- 
tiphosphotyrosine antibody was 
detected with biotinylated anti- 
mouse IgGl antibody and F1TC- 
conjugated streptavidin. Z-steps 
were carried out at 0.5- ^im inter- 
vals, and alternate optical sections 
are shown. Top and bottom panels 
from each row show E-cadherin 
and phosphotyrosine staining, re- 
spectively, from the same optical 
section. (C) High-resolution 
merged image of one optical sec- 
tion (represented by panels 7 and 
7' of B) showing colocalization (yel- 
low) in the x~y plane of phosphoty- 
rosine (green) and E-cadherin (red) 
staining. (D) Epifluorescence mi- 
croscopy of confluent monolayers 
of parental MDCK or a stable 
MDCK cell-line expressing a Pez 
mutant, R1127M, stained with the 
phosphotyrosine antibody. 



bind sufficiently stably to wt-Pez to be pulled-down or because 
the associated protein had been dephosphorylated by the cat- 
alytically active wt-Pez but not the inactive ST-Pez. Both causes 
for its absence in the wt-Pez pull-downs are consistent with 
Band 1 being a specific substrate of Pez. 

Because the molecular weight of Band 1 is similar to 
that of the AJ protein /3-catenin, we probed parallel lanes 
with a /3-catenin antibody. /3-Catenin was found to be 



pulled-down by both wt- and ST-Pez (Figure 3A, panel 3) 
and furthermore comigrated exactly with Band 1, suggest- 
ing that the substrate at this position could indeed be 
/3-catenin. Thus, if indeed Band 1 is jS-catenin, it was only 
tyrosine phosphorylated when in association with inac- 
tive ST-Pez, suggesting that the lack of tyrosine phosphor- 
ylation in the wt-Pez-associated protein was due to de- 
phosphoryiation by wt-Pez. 
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Figure 3. Identification of Pez substrates and interacting proteins 
by substrate trapping and by in vitro dephosphorylation. (A) Sub- 
strate trapping. HUVEC lysate enriched for tyrosine-phosphory- 
lated proteins (panel 1) was incubated for 2 h with Sepharose-bound 
anti-Hag immunoprecipitates from cells transfected with wt-Pez, 
ST-Pez, or empty vector as shown. The beads were then washed, 
and bound proteins were eluted and immunoblotted with various 
antibodies as indicated. (B) In vitro dephosphorylation of /3-catenin 
by GST-Pez. HUVEC lysate enriched for tyrosine-phosphorylated 
proteins (as in A) was incubated with either GST-wt-Pez or GST- 
ST-Pez bound to glutathione sepharose beads. After removal of the 
GST-Pez fusion proteins, /3-catenin was immunoprecipitated, and 
the immunoprecipitates were Western blotted with an antiphospho- 
tyrosine antibody. The amount of tyrosine-phosphorylated 0-cate- 
nin was quantitated by fluorimaging. The blots were then stripped, 
counterblotted with anti-0-catenin antibody, and quantitated for 
total 0-catenin. The ratio of tyrosine-phosphorylated j3-catenin to 
total /3-catenin for each GST-Pez treatment is shown (one represen- 
tative of three experiments). 

After stripping and reprobing the filters with antibodies to 
other components of junctional adhesion complexes, we 
identified a number of other junctional proteins, including 
a-catenin (band 2, Figure 3A, panel 4), ZO-1 (band 3, Figure 
3A, panel 5), and plakoglobin (unpublished data), which 
interact with Pez but do not appear to be substrates for its 
PTPase activity. 0-Catenin, a-catenin, and plakoglobin are 
all components of the AJ complex. ZO-1 is normally associ- 
ated with tight junctions in polarized epithelial and endo- 
thelial monolayers but in newly confluent HUVEC, which 
have not formed bona fide tight junctions, it is also associ- 
ated with AJs (Stevenson and Keon, 1998), suggesting that a 
complex of AJ proteins may be binding to Pez. 

The phosphotyrosyl-enriched HUVEC lysate that was the 
source of phospho- /3-catenin used in the substrate- trapping 
approach was made devoid of any active PTPs by iodoac- 
etate treatment. However, there is a possibility that the 



wt-Pez immunoprecipitates incubated with the lysate may 
contain other active PTPs in addition to Pez. To demonstrate 
that phospho-|3-catenin can be directly dephosphorylated by 
Pez, we carried out a similar experiment using recombinant 
Pez, prepared as a GST fusion protein from E. coli. As a 
control, we performed the same reaction with the inactive 
GST-ST-Pez. After incubation of recombinant Pez with the 
HUVEC lysate, |3-catenin was immunoprecipitated with an- 
ti-/3-catenin antibody and the amount of tyrosine-phospho- 
rylated /3-catenin remaining was quantitated by fluorimager 
analysis of phosphotyrosine immunoblots. Although the 
GST fusion proteins expressed poorly in E. coli (>90% of the 
products were degraded, presumably because the large fu- 
sion protein [—160 kDa] was poorly folded), the data 
showed that GST-wt-Pez, but not GST-ST-Pez, dephospho- 
rylated the /3-catenin in the lysate, removing about one third 
of the tyrosylphosphates on the 0-catenin (Figure 3B). GST- 
Pez proteins lacking the FERM domain were also expressed 
and assayed, showing that the forms with wt but not ST 
catalytic domains dephosphorylate /3-catenin to a similar 
extent as full-length GST-wt-Pez (unpublished data). 

These experiments indicated that Pez can directly dephos- 
phorylate j3-catenin in vitro. To further assess whether Pez 
and j3-catenin<interact in vivo, we investigated whether Pez 
can be coimmunopreciptated with /3-catenin. Immunopre- 
cipitation of endogenous /3-catenin from cell lysates contain- 
ing either endogenous or ectopically expressed Pez was 
carried out, followed by Western blotting to determine if Pez 
is coimmunoprecipitated. Endogenous Pez was detected in 
/3-catenin immunoprecipitates when a /3-catenin antibody, 
but not a control antibody, was used to immunoprecipitate 
/3-catenin from confluent HUVEC monolayers (Figure 4A). 
Similarly, ectopically expressed Pez coimmunoprecipitated 
with j3-catenin in cells that were transfected with an expres- 
sion vector bearing Pez cDNA but not empty vector (Fig, 
4B), confirming that Pez and /3-catenin interact in vivo. 

Although /3-catenin coimmunoprecipitated with both en- 
dogenous and ectopic Pez, the coimmunoprecipitation ap- 
peared to be relatively weak. This could have a number of 
possible explanations. First, the coimmunoprecipitations 
were carried out in 1% Triton X-100 with a brief pulse of 
sonication to maximize recovery of Pez. These conditions 
may be too harsh for the proteins to remain bound. In 
preliminary experiments to test the strength of the interac- 
tion between Pez and /3-catenin, we have observed better 
coimmunoprecipitations if the sonication step was omitted 
although more striking increases in coimmunoprecipitation 
were observed by reducing the concentration of detergent 
used in the lysis buffer from 1% to 0.5% Triton X-100 (Figure 
4B, right panel). This suggests that the complex formed may 
be detergent labile. Such detergent lability has been well 
documented with pl20 catenin (pl20ctn)-cadherin interac- 
tions, whereby only 5-20% of the pl20ctn in detergent ly- 
sates is associated with cadherin in contrast to its almost 
complete localization to AJs or membrane fraction under 
detergent-free conditions (reviewed in Anastasiadis and 
Reynolds, 2000). Finally, it is likely that not all the Pez within 
the cell is localized to the AJ. This has certainly been ob- 
served in endothelial cells where Pez expression is observed 
in the cytosol away from the cell junctions, even when the 
monolayer is confluent (Wadham et al, 2000). This is also 
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Figure 4. Coimmunoprecipitations of endogenous Pez and ectopi- 
cally expressed wt- and APTP-Pez with endogenous /3-catenin and 
pl20ctn. (A) HUVEC lysate was incubated with 0-catenin antibody 
or control isotype-matched irrelevant antibody, and immunopre- 
cipitates were Western blotted with Pez or /3-catenin antibodies as 
indicated; right panel shows Western blots of total lysate. (B) Left 
panel: Empty vector, wt-Pez, or APTP-Pez were transiently trans- 
fected into HEK293 cells. 0-Catenin was immunoprecipitated from 
the lysates of transfectants and Western blotted using a Flag-epitope 
antibody to detect ectopically expressed Pez. The blot was stripped 
and reblotted with a /3-catenin antibody showing the presence of 
/3-catenin in all three immunoprecipitations. Right panel as in left 
panel, except that lysis was carried out in 0.5% instead of the 1% 
Triton X-100, which was used in all the other immunoprecipitations 
shown. (C) Flag epitope-tagged Pez was immunoprecipitated with 
an anti-Flag antibody from HEK 293 cells stably transfected with 
either empty vector or Flag epitope-tagged Pez. The immunopre- 
cipitates were Western-blotted with an anti-pl20ctn antibody and 
reblotted with the anti-Flag antibody. 



particularly true when Pez is ectopically expressed in HEK 
293 cells (unpublished data). 

Because a number of AJ proteins were also pulled-down 
together with /3-catenin (Figure 3A) and because of their 
structural relatedness, we also investigated whether pl20ctn 
coimmunoprecipitates with Pez. Immunoprecipitates from 
lysates of vector- or Pez-transfected stable HEK 293 cell lines 
carried out using the Flag-epitope antibody were Western 
blotted with a pl20ctn antibody. pl20ctn coimmunoprecipi- 
tated with Pez that was immunoprecipitated with the Flag 



antibody (Figure 4C). Western blots of total lysates from 
both the vector- and Pez-transfected cells showed that HEK 
293 cells expressed two major pl20ctn isoforms of similar 
abundance (Figure 4C), with the larger isoform correspond- 
ing to full-length pl20ctn (isoform 1). Interestingly, only the 
smaller MW isoform (~95 kDa) coimmunoprecipitated with 
Pez. Isoforms 1 and 3 are the most commonly expressed 
(reviewed in Anastasiadis and Reynolds, 2000), hence the 
smaller isoform coimmunoprecipitating with Pez is most 
likely isoform 3, although this remains to be confirmed. It is 
unclear why Pez is only associated with one isoform of 
pl20ctn, and it is also unkown at this stage whether pl20ctn 
interacts directly with Pez and whether it is a substrate. 
These will be the subject of future studies. What is clear, 
however, is that the cell junctional proteins that are pulled- 
down with Pez are likely to be specific because the coimmu- 
noprecipitation discriminated between the two highly re- 
lated isoforms of pl20ctn. 

APTP-Pez Interacts with and Induces Tyrosine 
Phosphorylation of fi-Catenin 

If APTP-Pez acts as a dominant negative mutant of Pez (as 
suggested by its localization to the AJ [Figure IB] and by its 
ability to induce tyrosine phosphorylation of proteins at the 
AJ [Figure 2]), and if /3-catenin is a bona fide substrate of Pez, 
then APTP-Pez should interact with /3-catenin in vivo to 
increase its tyrosine phosphorylation. To investigate the 
ability of APTP-Pez to interact with |3-catenin in vivo, /3-cate- 
nin was immunoprecipitated from HEK293 cells stably 
transfected with APTP-Pez. The j3-catenin immunoprecipi- 
tates were then Western blotted with an anti-Flag antibody 
to detect APTP-Pez that has coimmunoprecipitated with 
/3-catenin. The data showed that APTP-Pez was able to co- 
immunoprecipitate with /3-catenin to the same extent as 
wt-Pez (Figure 4B, left panel). 

To examine whether APTP-Pez can induce tyrosine 
phophorylation of Pez substrates through a dominant neg- 
ative effect, antiphosphotyrosine Western blots were per- 
formed on extracts from confluent monolayers of MDCK 
cells stably expressing empty vector, wt-Pez, or APTP-Pez. 
Cells were serum-starved for 24 h before addition of serum 
for 10 min to induce tyrosine phosphorylation. To see spe- 
cific tyrosine phosphorylation of Pez substrates, cells were 
not pretreated with pervanadate, which would have caused 
a global increase in tyrosine phosphorylation. Thus, in con- 
cordance with other studies (Ayalon and Geiger, 1997), the 
basal level of tyrosine-phosphorylated proteins in vector- 
transfected cells was very low. Two bands that were specif- 
ically phosphorylated in extracts from APTP-Pez but not 
empty vector or wt-Pez transfected cells (Figure 5A, closed 
arrowheads) were observed. The lower MW band comi- 
grated with j3-catenin when the filter was counterb lotted 
with a /3-catenin antibody, suggesting that one of the pro- 
teins that is tyrosine phosphorylated through overexpres- 
sion of APTP-Pez is /3-catenin. Immunoprecipitation of 
j3-catenin followed by Western blotting with an antiphos- 
photyrosine antibody confirmed that /3-catenin was indeed 
phosphorylated in APTP-Pez-transfected but not empty vec- 
tor-transfected cells (Figure 5B). These data indicate that 
APTP-Pez acts as a dominant negative mutant of Pez leading 
to an increased level of /3-catenin tyrosine phosphorylation. 
The finding that APTP-Pez could interact with and increase 



2526 Molecular Biology of the Cell 



PTP Pez Regulates Adherens Junctions 



I. MDCK 
WB: PY Ab pcatenlnAb 



mW 

kOa 
173— 








111 — 
80 — 


. ■ 








61 — 










49 — 












eatenin 



WB: p<atenin 
Ab 



Figure 5. APTP-Pez enhances tyrosine phosphorylation of /3-cate- 
nin. (A) MDCK cell lines stably expressing vector, wt-Pez, or APTP- 
Pez were serum-starved for 24 h followed by 10-min stimulation 
with 10% FBS to induce tyrosine phosphorylation. Lysates were 
Western blotted with a phosphotyrosine (PY) antibody (left). Ar- 
rowheads show two proteins specifically tyrosine phosphorylated 
in the APTP-Pez transfected cells, one of which comigrates with 
/3-catenin (double arrowhead) when the blot was stripped and 
reblotted with a 0-catenin antibody (right). (B) Lysates from empty 
vector- or APTP-Pez-transfected MDCK cells were immunoprecipi- 
tated with a /3-catenin antibody followed by Western blotting of the 
immunoprecipitates with an antiphosphotyrosine antibody (top). 
The blot was stripped and counterprobed with a /3-catenin antibody 
(bottom). 



the tyrosine phosphorylation status of /3-catenin further re- 
inforces our conclusion that /3-catenin is a bona fide Pez 
substrate. The presence of another protein with increased 
tyrosine phosphorylation in APTP-Pez but not wt-Pez or 
empty vector-transfected cells (Figure 5A, open arrowhead) 
suggests that there are other Pez substrates that were not 
identified by substrate trapping. Any additional substrates 
are likely to also be located at the intercellular junctions 
because the tyrosine phosphorylation induced by APTP-Pez 
was highly specific to intercellular junctions (Figure 2A). 
The presence of other substrates at intercellular junctions in 
addition to /3-catenin would also account for the dramatic 
increase in tyrosine phosphorylation at intercellular junc- 
tions in APTP-Pez MDCK cells. Similarly, the tyrosine-phos- 
phorylated band comigrating with /3-catenin in Figure 5A 
may be comprised of more than one protein of the same 
relative mobility. The higher degree of tyrosine phosphory- 
lation relative to /3-catenin protein in the APTP-Pez lysate 
shown in Figure 5A compared with that in the /3-catenin 
immunoprecipitates in Figure 5B (where the observed phos- 
phorylation is due solely to /3-catenin) suggests that this may 
be the case. 



Overexpression of the Dominant Negative Mutant 
(APTP-Pez) Enhances Cell Migration 
Tyrosine phosphorylation of /3-catenin has been correlated 
with increased cell migration in a number of studies (Liu et 
al, 1997;Muller et al, 1999;Hollande et al, 2001). We used an 
in vitro "wound" assay to investigate whether the increase 
in tyrosine phosphorylation of /3-catenin that results from 
overexpression of the dominant negative mutant, APTP-Pez, 
could affect rates of cell migration. In this assay, a linear 
scratch was made on a confluent monolayer of MDCK cells 
to generate a linear denuded area, after which cells from the 



edge of the wound could migrate into the denuded area to 
repopulate it. After 24 h, cells overexpressing APTP-Pez had 
migrated further into the wound than cells overexpressing 
either empty vector or wt-Pez (Figure 6A). Measurements of 
the distances migrated after 24 h (Figure 6B) showed that the 
average distance migrated by the APTP transfectants was 
significantly greater (p = 0.02) than the distance migrated by 
either wt-Pez or vector control cells. There was no significant 
difference between the distances migrated by the wt-Pez 
cells and empty vector-transfected cells (p = 0.08). Higher 
resolution images of the "wound" shows that the cells at the 
edge are migrating into the wound characterized by protru- 
sions into the wound and formation of pseudopodia (Figure 
6C). Our observation that the dominant negative mutant, 
APTP-Pez, enhanced cell motility suggests that Pez is a 
regulator of cell motility, most likely through its role in 
regulating cell-cell adhesion. 



DISCUSSION 

Regulation of the integrity of AJs in response to external 
cues is important for proper tissue and organ formation 
during embryonal development and for wound healing in 
adults. The participation of tyrosine phosphorylation in reg- 
ulation of AJ function is apparent from the involvement of 
PTKs such as Src (Behrens et al, 1993;Fujita et al, 2002), EGF 
receptor, and HGF/scatter factor receptor (Shibarnoto et al, 
1994) in phosphorylation of AJ proteins, leading to de- 
creased cell-cell interaction and concomitant enhancement 
of cell migration. A number of PTPs have also been found to 
be components of the AJ complex, including the receptor 
PTPs ix, k, and LAR and the cytosolic PTP, PTP1B (reviewed 
in (Steinberg and McNutt, 1999). 

In this study, we have identified the PTP Pez as a novel 
PTP of AJs. A truncation mutant of Pez lacking the catalytic 
domain acted as a dominant negative mutant to enhance 
tyrosine phosphorylation of AJs and promote cell migration. 
Analysis of the proteins that are tyrosine phosphorylated as 
a result of the overexpression of the dominant negative 
mutant suggested that there are at least two Pez substrates 
in epithelial cells, one of which is the AJ protein /3-catenin. 
Using a substrate trapping mutant to isolate potential Pez 
substrates followed by in vitro dephosphorylation of /3-cate- 
nin by recombinant Pez, we confirmed that /3-catenin is 
indeed a substrate of Pez. Dephosphorylation of /3-catenin 
by recombinant Pez in the absence of any other active PTP 
also demonstrated that Pez could directly dephosphorylate 
/3-catenin. Both endogenous and ectopically expressed Pez 
coimmunoprecipitated with endogenous /3-catenin, indicat- 
ing that they interact in vivo, providing further evidence 
that Pez is a physiological regulator of /3-catenin tyrosine 
phosphorylation. 

The highly similar complement of AJ proteins pulled- 
down by both wt-Pez and ST-Pez, the observation that wt- 
Pez and ST-Pez can pull-down /3-catenin equally well, and 
the observation that wt-Pez can pull-down unphosphory- 
lated /3-catenin all suggest that Pez is likely to be a compo- 
nent of the AJ complex. This raises a number of questions. 
First, although we have demonstrated that Pez could di- 
rectly dephosphorylate /3-catenin, the question remains as to 
whether its association with the AJ complex is solely 
through binding to /3-catenin or through binding to some 
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Figure 6, APTP-Pez enhances 
cell motility. (A) Confluent mono- 
layers of MDCK cells stably ex- 
pressing empty vector, wt-Pez, or 
APTP-Pez were wounded using 
the edge of a cell scraper and pho- 
tographed immediately (Oh). Each 
wound was photographed again 
at the same spot 24 h later. (B) The 
distance between the wound 
edges was measured at the same 
point on each wound at 0 h and at 
24 h. The difference in distance 
between the two edges at 24 h 
and 0 h was taken to be the dis- 
tance migrated in 24 h. Each cell 
line was plated and wounded in 
triplicate and the experiment per- 
formed at least twice with similar 
outcomes; data from one repre- 
sentative experiment are pre- 
sented. **p = 0.02 when distance 
migrated by APTP- expressing 
cells was compared with cells ex- 
pressing empty vector. (C) Higher 
magnification image of APTP-Pez 
MDCK cells at the wound edge 
24 h after wounding. 



other component of the A J complex. Second, if Pez is a 
component of the AJ through direct binding to |3-catenin or 
some other protein, than one might expect that any tyrosine 
phosphorylation of 0-catenin at the AJ will be very transient. 
To achieve longer-term loosening of the AJ might then re- 
quire downregulation of Pez activity. 

Of the PTPs that have previously been shown to be local- 
ized to the AJ, PTP LAR and PTP IB have been shown to 
dephosphorylate /3-catenin, whereas the substrates of PTPjm 
are yet to be identified. There are potentially many reasons 
for multiple PTPs to be associated with AJs. These include 
cell type-specific expression of some PTPs, different degrees 
of responsiveness to external stimuli and different substrate 
specificities exhibited by different PTPs. In the case of /3-cate- 
nin, the crystal structure indicates there are potentially up to 
14 tyrosines that are accessible for phosphorylation. One of 
these, Tyr654, has been demonstrated to regulate the bind- 
ing of j3-catenin to E-cadherin (Roura et al, 1999), but the 
phospho-tyrosine that interrupts a-catenin binding is yet to 
be determined. It is conceivable that dephosphorylation of 
different tyrosines that mediate different functions on the 
one molecule may be regulated by different PTPs. A com- 
prehensive analysis of substrate specificity and responses to 
external stimuli for individual PTPs, which to date has not 
been carried out, is essential to fully elucidate the role of 
tyrosine phosphorylation in regulating AJ functions. 

Finally, some studies have reported that after the tyrosine 
phosphorylation of /3-catenin and its dissociation from in- 
tercellular junctions, it is translocated into the nucleus where 
it can, under some circumstances, interact with the LEF-1/ 



Tcf transcription factor to alter gene expression (Adam et al, 
2001; Kim and Lee, 2001; Monga et al, 2002). What is not 
clear from these studies is whether /3-catenin is translocated 
into the nucleus in its tyrosine-phosphorylated form and if 
so, whether the tyrosine-phosphorylated form can interact 
with LEF-1 /Tcf. Intriguingly, under conditions where cell- 
cell adhesion is disrupted, we have shown that Pez also 
translocates into the nucleus (Wadham et al, 2000). It would 
be particularly important to determine whether Pez and 
0-catenin interact in the nucleus and if so, what the func- 
tional consequence of this interaction is. 
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